Weakly supervised pain localization using multiple instance learning

作者: Karan Sikka , Abhinav Dhall , Marian Bartlett

DOI: 10.1109/FG.2013.6553762

关键词:

摘要: Automatic pain recognition from videos is a vital clinical application and, owing to its spontaneous nature, poses interesting challenges automatic facial expression (AFER) research. Previous vs no-pain systems have highlighted two major challenges: (1) ground truth provided for the sequence, but presence or absence of target given frame unknown, and (2) time point duration event(s) in each video are unknown. To address these issues we propose novel framework (referred as MS-MIL) where sequence represented bag containing multiple segments, instance learning (MIL) employed handle this weakly labeled data form level ground-truth. These segments generated via clustering running multi-scale temporal scanning window, using state-of-the-art Bag Words (BoW) representation. This work extends idea detecting expressions through `concept frames' segments' argues extensive experiments that algorithms like MIL needed reap benefits such The key advantages our approach are: joint detection localization painful frames only sequence-level ground-truth, incorporation dynamics by representing not individual (3) extraction which well suited signals with uncertain location video. Experiments on UNBC-McMaster Shoulder Pain dataset highlight effectiveness achieving promising results problem videos.

参考文章(27)
Zara Ambadar, Simon Lucey, Patrick J. Lucey, Jeffrey Cohn, Sridha Sridharan, Jessica M. Howlett, Improving Pain Recognition Through Better Utilisation of Temporal Information. Proceedings of the International Conference on Auditory-Visual Speech Processing 2008. ,vol. 2008, pp. 167- 172 ,(2008)
Paul Wohlhart, Martin Köstinger, Peter M. Roth, Horst Bischof, Multiple Instance Boosting for Face Recognition in Videos Lecture Notes in Computer Science. pp. 132- 141 ,(2011) , 10.1007/978-3-642-23123-0_14
Takeo Kanade, Facial Expression Analysis Lecture Notes in Computer Science. pp. 1- 1 ,(2005) , 10.1007/11564386_1
Carolina Galleguillos, Boris Babenko, Andrew Rabinovich, Serge Belongie, Weakly Supervised Object Localization with Stable Segmentations european conference on computer vision. pp. 193- 207 ,(2008) , 10.1007/978-3-540-88682-2_16
Gerasimos Potamianos, Chalapathy Neti, Giridharan Iyengar, Andrew W. Senior, Ashish Verma, A Cascade Visual Front End for Speaker Independent Automatic Speechreading International Journal of Speech Technology. ,vol. 4, pp. 193- 208 ,(2001) , 10.1023/A:1011352422845
T. Serre, L. Wolf, T. Poggio, Object recognition with features inspired by visual cortex computer vision and pattern recognition. ,vol. 2, pp. 994- 1000 ,(2005) , 10.1109/CVPR.2005.254
Jerome H. Friedman, Greedy function approximation: A gradient boosting machine. Annals of Statistics. ,vol. 29, pp. 1189- 1232 ,(2001) , 10.1214/AOS/1013203451
Jinjun Wang, Jianchao Yang, Kai Yu, Fengjun Lv, Thomas Huang, Yihong Gong, Locality-constrained Linear Coding for image classification computer vision and pattern recognition. pp. 3360- 3367 ,(2010) , 10.1109/CVPR.2010.5540018
Tomas Simon, Minh Hoai Nguyen, Fernando De La Torre, Jeffrey F. Cohn, Action unit detection with segment-based SVMs computer vision and pattern recognition. pp. 2737- 2744 ,(2010) , 10.1109/CVPR.2010.5539998
Kenneth D. Craig, Susan A. Hyde, Christopher J. Patrick, Genuine, suppressed and faked facial behavior during exacerbation of chronic low back pain. Pain. ,vol. 46, pp. 161- 171 ,(1991) , 10.1016/0304-3959(91)90071-5