Automatic MOOC Video Classification using Transcript Features and Convolutional Neural Networks

作者: Houssem Chatbri , Kevin McGuinness , Suzanne Little , Jiang Zhou , Keisuke Kameyama

DOI: 10.1145/3132390.3132393

关键词: Image (mathematics)Transformation (function)Speech recognitionConvolutional neural networkSupervised learningComputer scienceDigital video

摘要: The amount of MOOC video materials has grown exponentially in recent years. Therefore, their storage and analysis need to be made as fully automated possible order maintain management quality. In this work, we present a method for automatic topic classification videos using speech transcripts convolutional neural networks (CNN). Our works follows: First, recognition is used generate transcripts. Then, the are converted into images statistical co-occurrence transformation that designed. Finally, CNN produce category labels transcript image input. For our data, use Khan Academy on Stick dataset contains 2,545 videos, where each labeled with one or two 13 categories. Experiments show strongly competitive against other methods also based features supervised learning.

参考文章(24)
Amir Roshan Zamir, Khurram Soomro, Mubarak Shah, UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild arXiv: Computer Vision and Pattern Recognition. ,(2012)
Shai Shalev-Shwartz, Shai Ben-David, Understanding Machine Learning: From Theory to Algorithms ,(2015)
Gayle Christensen, Andrew Steinmetz, Brandon Alcorn, Amy Bennett, Deirdre Woods, Ezekiel Emanuel, None, The MOOC Phenomenon: Who Takes Massive Open Online Courses and Why? Social Science Research Network. ,(2013) , 10.2139/SSRN.2350964
Zhaowen Wang, Ding Liu, Jianchao Yang, Wei Han, Thomas Huang, None, Deep Networks for Image Super-Resolution with Sparse Prior 2015 IEEE International Conference on Computer Vision (ICCV). pp. 370- 378 ,(2015) , 10.1109/ICCV.2015.50
Joe Yue-Hei Ng, Matthew Hausknecht, Sudheendra Vijayanarasimhan, Oriol Vinyals, Rajat Monga, George Toderici, Beyond short snippets: Deep networks for video classification computer vision and pattern recognition. pp. 4694- 4702 ,(2015) , 10.1109/CVPR.2015.7299101
Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei, Large-Scale Video Classification with Convolutional Neural Networks computer vision and pattern recognition. pp. 1725- 1732 ,(2014) , 10.1109/CVPR.2014.223
Khaled M. Alraimi, Hangjung Zo, Andrew P. Ciganek, Understanding the MOOCs continuance Computer Education. ,vol. 80, pp. 28- 38 ,(2015) , 10.1016/J.COMPEDU.2014.08.006
Ivan Laptev, On Space-Time Interest Points international conference on computer vision. ,vol. 64, pp. 107- 123 ,(2005) , 10.1007/S11263-005-1838-7
Hijung Valentina Shin, Floraine Berthouzoz, Wilmot Li, Frédo Durand, Visual transcripts: lecture notes from blackboard-style lecture videos international conference on computer graphics and interactive techniques. ,vol. 34, pp. 240- ,(2015) , 10.1145/2816795.2818123
Philip J Guo, Juho Kim, Rob Rubin, None, How video production affects student engagement: an empirical study of MOOC videos learning at scale. pp. 41- 50 ,(2014) , 10.1145/2556325.2566239