Multi-Modal Knowledge Graph Transformer Framework for Multi-Modal Entity Alignment

descriptionPublicationkeyboard_double_arrow_right Article , Preprint , Conference object 01 Jan 2023Embargo end date: 01 Jan 2023Publisher:Association for Computational Linguistics (ACL)Journal:Findings of the Association for Computational Linguistics: EMNLP 2023

Authors: Qian Li 0033; Cheng Ji 0001; Shu Guo; Zhaoji Liang; Lihong Wang; Jianxin Li 0002;

doi: 10.18653/v1/2023.findings-emnlp.70 , 10.48550/arxiv.2310.06365

arXiv: 2310.06365

Multi-Modal Knowledge Graph Transformer Framework for Multi-Modal Entity Alignment

- Summary
- Subjects
- Metrics

Abstract

Multi-Modal Entity Alignment (MMEA) is a critical task that aims to identify equivalent entity pairs across multi-modal knowledge graphs (MMKGs). However, this task faces challenges due to the presence of different types of information, including neighboring entities, multi-modal attributes, and entity types. Directly incorporating the above information (e.g., concatenation or attention) can lead to an unaligned information space. To address these challenges, we propose a novel MMEA transformer, called MoAlign, that hierarchically introduces neighbor features, multi-modal attributes, and entity types to enhance the alignment task. Taking advantage of the transformer's ability to better integrate multiple information, we design a hierarchical modifiable self-attention block in a transformer encoder to preserve the unique semantics of different information. Furthermore, we design two entity-type prefix injection methods to integrate entity-type information using type prefixes, which help to restrict the global information of entities not present in the MMKGs. Our extensive experiments on benchmark datasets demonstrate that our approach outperforms strong competitors and achieves excellent entity alignment performance.

Related Organizations

Beihua University
China (People's Republic of)
University of Chinese Academy of Sciences
China (People's Republic of)
Beihang University
China (People's Republic of)
National Computer Network Emergency Response Technical Team/Coordination Center of Chinar
China (People's Republic of)

Keywords

FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	14
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Top 10%
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Top 10%
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Top 10%

Found an issue? Give us feedback

14

Top 10%

Green