Journal of Applied Sciences ›› 2025, Vol. 43 ›› Issue (2): 234-244.doi: 10.3969/j.issn.0255-8297.2025.02.004

• Communication Engineering • Previous Articles    

Video Anomaly Detection Method Based on Multi-scale Feature Fusion and Attention Mechanism

WU Xiang, XIAO Jian, JI Genlin   

  1. School of Computer and Electronic Information/School of Artificial Intelligence, Nanjing Normal University, Nanjing 210023, Jiangsu, China
  • Received:2023-07-19 Published:2025-04-03

Abstract: Motion objects in video frames often exhibit diverse scales over time, which poses a challenge for video anomaly detection. Although traditional generative adversarial networks (GANs) have achieved some success in video anomaly detection tasks, their performance is limited due to the use of a single-scale feature extraction that fails to capture features of objects at different scales. To address this issue, this paper proposes a video anomaly detection method based on a GAN structure that incorporates multi-scale feature fusion and attention mechanisms. Specifically, different-sized convolutional kernels are employed to capture features with varying receptive fields, which are then fused to obtain multi-scale feature representations. Additionally, a coordinate attention mechanism is introduced after the transposed convolutional layers of the generator, allowing adaptive allocation of feature map weights to enhance the model’s perception of crucial features.Experimental results on the public datasets UCSD Ped2 and Avenue demonstrate that the proposed method outperforms existing approaches.

Key words: video anomaly detection, deep learning, generative adversarial networks, multi-scale feature fusion, attention mechanism

CLC Number: