What Is Learned in Knowledge Graph Embeddings?

descriptionPublicationkeyboard_double_arrow_right Part of book or chapter of book , Article , Preprint , Conference object 01 Jan 2022Embargo end date: 01 Jan 2021 English Publisher:Springer International Publishing

Authors: Michael R. Douglas; Michael Simkin; Omri Ben-Eliezer; Tianqi Wu; Peter Chin 0001; Trung V. Dang; Andrew Wood;

doi: 10.1007/978-3-030-93413-2_49 , 10.48550/arxiv.2110.09978

arXiv: 2110.09978

What Is Learned in Knowledge Graph Embeddings?

- Summary
- Subjects
- Metrics

Abstract

A knowledge graph (KG) is a data structure which represents entities and relations as the vertices and edges of a directed graph with edge types. KGs are an important primitive in modern machine learning and artificial intelligence. Embedding-based models, such as the seminal TransE [Bordes et al., 2013] and the recent PairRE [Chao et al., 2020] are among the most popular and successful approaches for representing KGs and inferring missing edges (link completion). Their relative success is often credited in the literature to their ability to learn logical rules between the relations. In this work, we investigate whether learning rules between relations is indeed what drives the performance of embedding-based methods. We define motif learning and two alternative mechanisms, network learning (based only on the connectivity of the KG, ignoring the relation types), and unstructured statistical learning (ignoring the connectivity of the graph). Using experiments on synthetic KGs, we show that KG models can learn motifs and how this ability is degraded by non-motif (noise) edges. We propose tests to distinguish the contributions of the three mechanisms to performance, and apply them to popular KG benchmarks. We also discuss an issue with the standard performance testing protocol and suggest an improvement. To appear in the proceedings of Complex Networks 2021.

16 pages

Related Organizations

Massachusetts Institute of Technology
United States
Harvard University
United States
Boston University
United States
Clarke University
United States
Stony Brook University
United States

View all View all

Keywords

FOS: Computer and information sciences, Artificial Intelligence (cs.AI), I.2.4, Computer Science - Artificial Intelligence

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	2
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

2

Average

Green

Fields of Science (4) View all

natural sciences

Fields of Science

natural sciences

View all