Multi-source Multi-modal Activity Recognition in Aerial Video Surveillance

作者: Riad I Hammoud , Cem S Sahin , Erik P Blasch , Bradley J Rhodes

DOI: 10.1109/CVPRW.2014.44

关键词:

摘要: Recognizing activities in wide aerial/overhead imagery remains a challenging problem due part to low-resolution video and cluttered scenes with large number of moving objects. In the context this research, we deal two un-synchronized data sources collected real-world operating scenarios: full-motion videos (FMV) analyst call-outs (ACO) form chat messages (voice-to-text) made by human watching streamed FMV from an aerial platform. We present multi-source multi-modal activity/event recognition system for surveillance applications, consisting of: (1) detecting tracking multiple dynamic targets platform, (2) representing target tracks as graphs attributes, (3) associating using probabilistic graph-based matching approach, (4) spatial-temporal activity boundaries. also pattern learning framework which uses associated training index archive videos. Finally, describe multi-intelligence user interface querying interest (AOIs) movement type geo-location, playing-back summary text segments targets-of-interest (TOIs) (in both pixel geo-coordinates). Such tools help end-user quickly search, browse, prepare mission reports data.

参考文章(69)
Howard D. Wactlar, Pinar Duygulu, Associating video frames with text ,(2003)
Richard Cannata, Jay Hackett, Jeremy Jackson, Tariq Bakir, Ronald Alan Riley, Video summarization using video frames from different perspectives ,(2010)
James W. Owens, Casey L. Miller, Tony T. Di Croce, Jason A. Heddings, Steven D. Martin, Shu Yang, Greg Millar, Visual command processing ,(2012)
Osama Masoud, Benjamin Maurin, Nikos Papanikolopoulos, CAMERA SURVEILLANCE OF CROWDED TRAFFIC SCENES ITS America 12th Annual Meeting and Exposition: Securing Our FutureIntelligent Transportation Society of America (ITS America). ,(2002)
Tae Eun Choe, Hongli Deng, Mun Wai Lee, Feng Guo, Graph matching by sub-graph grouping and indexing ,(2014)
John Farneman, Mobile video surveillance system ,(2005)
Charles Eubank, Lynn D. Churchill, Joseph Kimmey, Intelligent notification system and method ,(2014)
Anthony Hoogs, Alex Aved, Genshe Chen, Haibin Ling, Dan Shen, Riad I. Hammoud, James Nagy, William M. Pottenger, Eric K. Jones, Arslan Basharat, Michael Schneider, Erik Blasch, Context aided video-to-text information fusion international conference on information fusion. pp. 1- 8 ,(2014)