iCaps-Dfake: An Integrated Capsule-Based Model for Deepfake Image and Video Detection

作者: Sherin M. Youssef , Samar Samir Khalil , Sherine Nagy Saleh

DOI: 10.3390/FI13040093

关键词: Artificial neural networkComputer scienceLocal binary patternsThe InternetGeneralizationBenchmark (computing)Artificial intelligenceDeep learningFeature extractionConvolutional neural networkMachine learning

摘要: Fake media is spreading like wildfire all over the internet as a result of great advancement in deepfake creation tools and huge interest researchers corporations are showing to explore its limits. Now anyone can create manipulated unethical forensics, defame, humiliate others or even scam them out their money with click button. In this research new detection approach, iCaps-Dfake, proposed that competes state-of-the-art techniques video addresses low generalization problem. Two feature extraction methods combined, texture-based Local Binary Patterns (LBP) Convolutional Neural Networks (CNN) based modified High-Resolution Network (HRNet), along an application capsule neural networks (CapsNets) implementing concurrent routing technique. Experiments have been conducted on large benchmark datasets evaluate performance model. Several metrics applied experimental results analyzed. The model was primarily trained tested DeepFakeDetectionChallenge-Preview (DFDC-P) dataset then Celeb-DF examine capability. achieved Area-Under Curve (AUC) score improvement 20.25% models.

参考文章(49)
Geoffrey E. Hinton, Alex Krizhevsky, Sida D. Wang, Transforming auto-encoders international conference on artificial neural networks. pp. 44- 51 ,(2011) , 10.1007/978-3-642-21735-7_6
Karen Simonyan, Andrew Zisserman, Very Deep Convolutional Networks for Large-Scale Image Recognition computer vision and pattern recognition. ,(2014)
M Sai Praneeth, Xudong Peng, Alice Li, Shahrzad Hosseini Vajargah, Going deeper with convolutions computer vision and pattern recognition. pp. 1- 9 ,(2015) , 10.1109/CVPR.2015.7298594
, Generative Adversarial Nets neural information processing systems. ,vol. 27, pp. 2672- 2680 ,(2014) , 10.3156/JSOFT.29.5_177_2
Rukundo Olivier, Cao Hanqiang, Nearest neighbor value interpolation International Journal of Advanced Computer Science and Applications. ,vol. 3, pp. 25- 30 ,(2012) , 10.14569/IJACSA.2012.030405
Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun, Deep Residual Learning for Image Recognition computer vision and pattern recognition. pp. 770- 778 ,(2016) , 10.1109/CVPR.2016.90
Justus Thies, Michael Zollhofer, Marc Stamminger, Christian Theobalt, Matthias Niessner, Face2Face: Real-Time Face Capture and Reenactment of RGB Videos computer vision and pattern recognition. pp. 2387- 2395 ,(2016) , 10.1109/CVPR.2016.262
Zhifeng Li, Yu Qiao, Kaipeng Zhang, Zhanpeng Zhang, Joint Face Detection and Alignment using Multi-task Cascaded Convolutional Networks arXiv: Computer Vision and Pattern Recognition. ,(2016) , 10.1109/LSP.2016.2603342
Supasorn Suwajanakorn, Steven M. Seitz, Ira Kemelmacher-Shlizerman, Synthesizing Obama: learning lip sync from audio ACM Transactions on Graphics. ,vol. 36, pp. 95- ,(2017) , 10.1145/3072959.3073640