Learning laparoscopic video shot classification for gynecological surgery

作者: Stefan Petscharnig , Klaus Schöffmann

DOI: 10.1007/S11042-017-4699-5

关键词: Shot (filmmaking)Endoscopic surgeryRecallComputer scienceContextual image classificationMedical researchConvolutional neural networkNatural language processingGynecological surgeryEndometriosisComputer visionDeep learningArtificial intelligenceMyoma

摘要: Videos of endoscopic surgery are used for education medical experts, analysis in research, and documentation everyday clinical life. Hand-crafted image descriptors lack the capabilities a semantic classification surgical actions video shots anatomical structures. In this work, we investigate how well single-frame convolutional neural networks (CNN) shot gynecologic work. Together with manually annotate hours raw videos showing endometriosis treatment myoma resection over 100 patients. The cleaned ground truth dataset comprises 9 h annotated material (from 111 different recordings). We use well-known CNN architectures AlexNet GoogLeNet train these both, anatomy, from scratch. Furthermore, extract high-level features weights pre-trained model Caffe zoo feed them to an SVM classifier. Our evaluation shows that reach average recall .697 .515 structures respectively using off-the-shelf features. Using GoogLeNet, achieve mean .782 .617 respectively. With achieved is .615 .469 action main conclusion our work advances general methods transfer domain gynecology. This relevant as natural images, e.g. it distinguished by smoke, reflections, or limited amount colors.

参考文章(30)
Manfred Jurgen Primus, Klaus Schoeffmann, Laszlo Boszormenyi, Instrument classification in laparoscopic videos content based multimedia indexing. pp. 1- 6 ,(2015) , 10.1109/CBMI.2015.7153616
Joe Yue-Hei Ng, Matthew Hausknecht, Sudheendra Vijayanarasimhan, Oriol Vinyals, Rajat Monga, George Toderici, Beyond short snippets: Deep networks for video classification computer vision and pattern recognition. pp. 4694- 4702 ,(2015) , 10.1109/CVPR.2015.7299101
Mandar Dixit, Si Chen, Dashan Gao, Nikhil Rasiwasia, Nuno Vasconcelos, Scene classification with semantic Fisher vectors computer vision and pattern recognition. pp. 2974- 2983 ,(2015) , 10.1109/CVPR.2015.7298916
Bernd Munzer, Klaus Schoeffmann, Laszlo Boszormenyi, Relevance Segmentation of Laparoscopic Videos international symposium on multimedia. pp. 84- 91 ,(2013) , 10.1109/ISM.2013.22
Andrej Karpathy, George Toderici, Sanketh Shetty, Thomas Leung, Rahul Sukthankar, Li Fei-Fei, Large-Scale Video Classification with Convolutional Neural Networks computer vision and pattern recognition. pp. 1725- 1732 ,(2014) , 10.1109/CVPR.2014.223
Qing Li, Weidong Cai, Xiaogang Wang, Yun Zhou, David Dagan Feng, Mei Chen, Medical image classification with convolutional neural network international conference on control, automation, robotics and vision. pp. 844- 848 ,(2014) , 10.1109/ICARCV.2014.7064414
Klaus Schoeffmann, Manfred Del Fabro, Tibor Szkaliczki, Laszlo Böszörmenyi, Jörg Keckstein, Keyframe extraction in endoscopic video Multimedia Tools and Applications. ,vol. 74, pp. 11187- 11206 ,(2015) , 10.1007/S11042-014-2224-7
Gwenole Quellec, Mathieu Lamard, Beatrice Cochener, Guy Cazuguel, Real-Time Segmentation and Recognition of Surgical Tasks in Cataract Surgery Videos IEEE Transactions on Medical Imaging. ,vol. 33, pp. 2352- 2360 ,(2014) , 10.1109/TMI.2014.2340473
M Sai Praneeth, Xudong Peng, Alice Li, Shahrzad Hosseini Vajargah, Going deeper with convolutions computer vision and pattern recognition. pp. 1- 9 ,(2015) , 10.1109/CVPR.2015.7298594
Bernd Munzer, Klaus Schoeffmann, Laszlo Boszormenyi, J.F. Smulders, Jack J. Jakimowicz, Investigation of the Impact of Compression on the Perceptional Quality of Laparoscopic Videos computer based medical systems. pp. 153- 158 ,(2014) , 10.1109/CBMS.2014.58