Journal of Applied Sciences ›› 2026, Vol. 44 ›› Issue (3): 503-514.doi: 10.3969/j.issn.0255-8297.2026.03.011

• Artiflcial Intelligence Technology and Applications • Previous Articles    

Multi-modal Rumor Detection Method Fusing Image and Text Features

GAO Guangliang1, LIANG Weichao2, ZHU Tao3, HONG Lei1, XIA Lingling4   

  1. 1. School of National Security, Jiangsu Police Institute, Nanjing 210031, Jiangsu, China;
    2. School of Computer Science and Engineering, Guangxi Normal University, Guilin 541004, Guangxi, China;
    3. School of Digital and Intelligent Police Technology, Jiangsu Police Institute, Nanjing 210031, Jiangsu, China;
    4. School of Cyberspace Security, Jiangsu Police Institute, Nanjing 210031, Jiangsu, China
  • Received:2025-12-17 Published:2026-06-23

Abstract: To further improve the effectiveness and stability of rumor detection, a multimodal rumor detection method was proposed that integrated image and text features with adaptive noise resistance and semantic restoration. First, a three-stage processing pipeline consisting of pretrained encoding, dynamic pooling, and multi-head enhancement was designed to encode rumor-related texts and comments into semantic vectors. Then,two parallel modules were constructed: one for visual feature extraction and the other for optical character feature extraction. These modules encoded rumor images and explicit text within them into complementary enhanced image vectors. Finally, adaptive gating and dynamic cross-attention mechanisms were used to filter noise and enhance semantics,achieving local alignment and global integration of image and text information. Experimental results show that, compared with baseline algorithms, the proposed method can effectively capture deep correlations between image and text information and improve credibility and practicality of rumor detection results.

Key words: public security intelligence, rumor detection, image-text semantic matching, multimodal fusion

CLC Number: