Journal of Applied Sciences ›› 2024, Vol. 42 ›› Issue (6): 1040-1051.doi: 10.3969/j.issn.0255-8297.2024.06.012

• Computer Science and Applications • Previous Articles     Next Articles

A Multimodal Knowledge Graph Entity Alignment Method

LIU Wei1,2, XU Hui1,2, LI Weimin1   

  1. 1. School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China;
    2. School of Artificial Intelligence, Shanghai University, Shanghai 200444, China
  • Received:2022-11-14 Online:2024-11-30 Published:2024-11-30

Abstract: The fusion of multimodal knowledge graph requires addressing the entity alignment problem in knowledge fusion. In multimodal knowledge graph, multimodal attributes can provide key alignment information to improve entity alignment effectiveness. This paper proposes a method for entity alignment in multimodal knowledge graphs based on multimodal attribute embedding and graph attention network. First, the multimodal knowledge graph is divided into subgraphs according to image, text and graph structure information. Text and graph structure information are then extracted by graph attention network, while image information is extracted by visual geometry group (VGG) network. These multimodal attributes are embedded into vector space. Finally, the proposed method integrates the multimodal attributes and the graph structure of the subgraphs for alignment. Experimental results shows that the proposed model significantly improves performance, achieving increases of 10.64% on Hits@1, 5.60% on Hits@10, and 0.226 on MRR compared to four baseline models for entity alignment.

Key words: multimodal knowledge graph, entity alignment, multimodal attribute embedding, graph attention network

CLC Number: