Several fake face detectors based on convolutional neural network (CNN) have been reported to resist the impact of fake faces, but they all face a common problem that the intra-dataset test is generally with high accuracy, but the performance of crossdataset test drops significantly, which indicates low generalization ability. Based on thorough evaluations for five popular fake face detectors including MesoInception-4, MISLnet, ShallowNetV1, Inception-v3 and Xception, this paper completes both intra-dataset test and cross-dataset test on three fake face datasets. In experiment, the effects on generalization ability from of factors, such as dataset partition, data augmentation and threshold selection, are investigated.
[1] Blanz V, Scherbaum K, Vetter T, et al. Exchanging faces in images[J]. Computer Graphics Forum, 2004, 23(3):669-676.
[2] Agarwala A, Dontcheva M, Agrawala M, et al. Interactive digital photomontage[J]. ACM Transactions on Graphics, 2004, 23(3):294-302.
[3] Bitouk D, Kumar N, Dhillon S, et al. Face swapping:automatically replacing faces in photographs[J]. ACM Transactions on Graphics, 2008, 27(3):9:1-9:8.
[4] Williams L. Performance-driven facial animation[C]//ACM SIGGRAPH Computer Graphics. ACM, 1990, 24(4):235-242.
[5] Deng Z, Neumann U. Data-driven 3D facial animation[J]. Data Drivend Facial Animation, 2008:1-28.
[6] Ma W C, Jones A, Chiang J Y, et al. Facial performance synthesis using deformation-driven polynomial displacement maps[J]. ACM Transactions on Graphics, 2008, 27(5):121:1-121:10.
[7] Li H, Adams B, Guibas L J, et al. Robust single-view geometry and motion reconstruction[J]. ACM Transactions on Graphics, 2009, 28(5):175:1-175:10.
[8] Bradley D, Heidrich W, Popa T, et al. High resolution passive facial performance capture[J]. ACM Transactions on Graphics, 2010, 29(4):41:1-41:10.
[9] Beeler T, Hahn F, Bradley D, et al. High-quality passive facial performance capture using anchor frames[J]. ACM Transactions on Graphics, 2011, 30(4):75:1-75:10.
[10] Borshukov G, Piponi D, Larsen O, et al. Universal capture-image-based facial animation for the matrix reloaded[C]//ACM Siggraph 2005 Courses. ACM, 2005:16:1.
[11] Alexander O, Rogers M, Lambeth W, et al. The Digital Emily project:photoreal facial modeling and animation[C]//Acmsiggraph 2009 Courses. ACM, 2009:12:1-12:15.
[12] Vlasic D, Brand M, Pfister H, et al. Face transfer with multilinear models[J]. ACM Transactions on Graphics, 2005, 24(3):426-433.
[13] Dale K, Sunkavalli K, Johnson M K, et al. Video face replacement[J]. ACM Transactions on Graphics, 2011, 30(6):130:1-130:10.
[14] THIES J, Zollhöfer M, Nießner M, et al. Real-time expression transfer for facial reenactment[J]. ACM Transactions on Graphics, 2015, 34(6):183:1-183:14.
[15] Thies J, Zollhöfer M, Stamminger M, et al. Face2Face:real-time face capture and reenactment of RGB videos[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016:2387-2395.
[16] Antipov G, Baccouche M, Dugelay J L. Face aging with conditional generative adversarial networks[C]//2017 IEEE International Conference on Image Processing (ICIP). IEEE, 2017:2089-2093.
[17] Tewari A, ZollhÖFer M, Kim H, et al. Mofa:Model-based deep convolutional face autoencoder for unsupervised monocular reconstruction[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017:1274-1283.
[18] Nirkin Y, Masi I, Tuan A T, et al. On face segmentation, face swapping, and face perception[C]//2018 13th IEEE International Conference on Automatic Face and Gesture Recognition. IEEE, 2018:98-105.
[19] Kim H, Carrido P, Tewari A, et al. Deep video portraits[J]. ACM Transactions on Graphics, 2018, 37(4):163:1-163:14.
[20] Rössler a, Cozzolino D, Verdoliva L, et al. FaceForensics++:learning to detect manipulated facial images[DB/OL]. 2019[2019-01-25]. arXiv:1901.08971.
[21] Korshunov P, Marcel S. DeepFakes:a new threat to face recognition? assessment and detection[DB/OL]. 2018[2018-12-20]. arXiv:1812.08685.
[22] Khodabakhsh A, Ramachandra R, Raja K, et al. Fake face detection methods:can they be generalized?[C]//2018 International Conference of the Biometrics Special Interest Group (BIOSIG). IEEE, 2018:1-6.
[23] Afchar D, Nozick V, Yamagishi J, et al. Mesonet:a compact facial video forgery detection network[C]//IEEE International Workshop on Information Forensics and Security (WIFS). IEEE, 2018:1-7.
[24] Bayar B, Stamm M C. Constrained convolutional neural networks:a new approach towards general purpose image manipulation detection[J]. IEEE Transactions on Information Forensics and Security, 2018, 13(11):2691-2706.
[25] Tariq S, Lee S, Kim H, et al. Detecting both machine and human created fake face images in the wild[C]//The 2nd International Workshop on Multimedia Privacy and Security. ACM, 2018:81-87.
[26] Szegedy C, Vanhoucke V, Ioffe S, et al. Rethinking the inception architecture for computer vision[C]//IEEE Conference on Computer Vision and Pattern Recognition. 2016:2818-2826.
[27] Chollet F. Xception:deep learning with depthwise separable convolutions[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2017:1251-1258.
[28] Szegedy C, Liu W, Jia Y, et al. Going deeper with convolutions[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2015:1-9.
[29] Sanderson C, Lovell B C. Multi-region probabilistic histograms for robust and scalable identity inference[C]//International conference on biometrics. Springer, 2009:199-208.
[30] Bulat A, Tzimiropoulos G. How far are we from solving the 2d & 3d face alignment problem?(and a dataset of 230,0003d facial landmarks)[C]//IEEE International Conference on Computer Vision, 2017:1021-1030.
[31] Kingma D P, Ba J. Adam:a method for stochastic optimization[DB/OL]. 2017[2017-01-30]. arXiv:1412.6980.