Journal of Applied Sciences ›› 2025, Vol. 43 ›› Issue (3): 451-462.doi: 10.3969/j.issn.0255-8297.2025.03.007

• Computer Science and Applications • Previous Articles    

Frequency-Domain Multi-feature Fusion for Deepfake Video Detection Based on Key Frames

WANG Jinwei1,2,3, ZHANG Meigui1, ZHANG Jiawei1, LUO Xiangyang3, MA Bin4   

  1. 1. School of Computer Science, School of Cyber Science and Engineering, Nanjing University of Information Science and Technology, Nanjing 210044, Jiangsu, China;
    2. Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology, Nanjing University of Information Science and Technology, Nanjing 210044, Jiangsu, China;
    3. State Key Laboratory of Mathematical Engineering and Advanced Computing, Zhengzhou 450001, Henan, China;
    4. Shandong Provincial Key Laboratory of Computer Networks, Qilu University of Technology, Jinan 250353, Shandong, China
  • Received:2022-04-11 Published:2025-06-23

Abstract: To avoid data redundancy and save computing resources, most of the existing Deepfake video detection methods select multiple frames or partial segments of videos as the detection objects. However, this selection strategy compromises the representation ability of the detection objects and limits the performance. Moreover, while the existing algorithms perform well on individual datasets, their performance degrade seriously when detecting across datasets, highlighting the need for improved generalization. To address these challenges, we propose a frequency domain multi-feature fusion algorithm for Deepfake video detection based on key frames. The mean square error in frequency domain is used to extract the key frames as the detection objects. Then the artifact features of the main frame and temporal inconsistency features between the key frames are learned in frequency domain. These features are fused and passed through a fully connected layer to obtain the final detection results. Experimental results show that our algorithm achieves superior performance in cross-dataset detection compared to existing methods, showcasing strong generalization capabilities.

Key words: Deepfake video detection, key frames, frequency domain, multi-feature fusion

CLC Number: