Finding Interesting Frames in Deep Video Analytics: a Top-K Approach

作者: Ben Kao , Eric Lo , Chris Liu , Ziliang Lai , Chenxia Han

DOI:

关键词:

摘要: Recently, the impressive accuracy of deep neural networks (DNNs) has created great demands on practical analytics over video data. Although efficient and accurate, latest analytic systems have not supported beyond simple queries like selection. In data analytics, Top-K is a very important analytical operation that enables analysts to focus most entities. this paper, we present Everest, first system supports accurate analytics. Everest ranks identifies interesting frames/moments from videos with probabilistic guarantees. built careful synthesis computer vision, machine learning, uncertain management, query processing. Evaluations five real-world Visual Road benchmark show achieves between 16.3x 20.6x higher efficiency than baseline approaches high result accuracy.

参考文章(78)
Chien-Chun Hung, Ganesh Ananthanarayanan, Peter Bodik, Leana Golubchik, Minlan Yu, Paramvir Bahl, Matthai Philipose, VideoEdge: Processing Camera Streams using Hierarchical Clusters information security. pp. 115- 131 ,(2018) , 10.1109/SEC.2018.00016
Aaron J. Elmore, Sanjay Krishnan, Adam Dziedzic, DeepLens: Towards a Visual Data Management System arXiv: Databases. ,(2018)
Brandon Haynes, Amrita Mazumdar, Magdalena Balazinska, Luis Ceze, Alvin Cheung, Visual Road: A Video Data Management Benchmark international conference on management of data. pp. 972- 987 ,(2019) , 10.1145/3299869.3324955
Ioannis Xarchakos, Nick Koudas, SVQ: Streaming Video Queries international conference on management of data. pp. 2013- 2016 ,(2019) , 10.1145/3299869.3320230
Shaoqing Ren, Kaiming He, Jian Sun, Xiangyu Zhang, Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification arXiv: Computer Vision and Pattern Recognition. ,(2015)
Matei Zaharia, Peter Bailis, Firas Abuzaid, Daniel Kang, John Emmons, NoScope: Optimizing Neural Network Queries over Video at Scale arXiv: Databases. ,(2017)
Qi Ye, Tae-Kyun Kim, Occlusion-Aware Hand Pose Estimation Using Hierarchical Mixture Density Network european conference on computer vision. pp. 817- 834 ,(2018) , 10.1007/978-3-030-01249-6_49
Peter Bodik, Matthai Philipose, Paramvir Bahl, Ganesh Ananthanarayanan, Phillip B. Gibbons, Onur Mutlu, Kevin Hsieh, Shivaram Venkataraman, Focus: querying large video datasets with low latency and low cost operating systems design and implementation. pp. 269- 286 ,(2018) , 10.5555/3291168.3291188
Tejaswi Potluri, Nitta Gnaneswara Rao, Content Based Video Retrieval Using SURF, BRISK and HARRIS Features for Query-by-image International Conference on Recent Trends in Image Processing and Pattern Recognition. pp. 265- 276 ,(2018) , 10.1007/978-981-13-9181-1_24
Kayvon Fatahalian, Maneesh Agrawala, Christopher Ré, Xinwei Yao, Anh Truong, Daniel Y. Fu, Will Crichton, Haotian Zhang, James Hong, Avanika Narayan, Rekall: Specifying Video Events using Compositions of Spatiotemporal Labels arXiv: Databases. ,(2019)