Inductive entity representations from text via link prediction

D. Daza; M. Cochez; P. Groth

doi:https://doi.org/10.1145/3442381.3450141

Inductive entity representations from text via link prediction

Authors	D. Daza M. Cochez P. Groth
Publication date	2021
Book title	The Web Conference 2021
Book subtitle	proceedings of the World Wide Web Conference WWW 2021 : April 19-23, 2021, Ljubljana, Slovenia
ISBN (electronic)	9781450383127
Event	2021 World Wide Web Conference, WWW 2021
Pages (from-to)	798-808
Number of pages	11
Publisher	New York, NY: Association for Computing Machinery
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Knowledge Graphs (KG) are of vital importance for multiple applications on the web, including information retrieval, recommender systems, and metadata annotation. Regardless of whether they are built manually by domain experts or with automatic pipelines, KGs are often incomplete. To address this problem, there is a large amount of work that proposes using machine learning to complete these graphs by predicting new links. Recent work has begun to explore the use of textual descriptions available in knowledge graphs to learn vector representations of entities in order to preform link prediction. However, the extent to which these representations learned for link prediction generalize to other tasks is unclear. This is important given the cost of learning such representations. Ideally, we would prefer representations that do not need to be trained again when transferring to a different task, while retaining reasonable performance. Therefore, in this work, we propose a holistic evaluation protocol for entity representations learned via a link prediction objective. We consider the inductive link prediction and entity classification tasks, which involve entities not seen during training. We also consider an information retrieval task for entity-oriented search. We evaluate an architecture based on a pretrained language model, that exhibits strong generalization to entities not observed during training, and outperforms related state-of-the-art methods (22% MRR improvement in link prediction on average). We further provide evidence that the learned representations transfer well to other tasks without fine-tuning. In the entity classification task we obtain an average improvement of 16% in accuracy compared with baselines that also employ pre-trained models. In the information retrieval task, we obtain significant improvements of up to 8.8% in NDCG@10 for natural language queries. We thus show that the learned representations are not limited KG-specific tasks, and have greater generalization properties than evaluated in previous work.
Document type	Conference contribution
Language	English
Related dataset	Inductive WN18RR and FB15k-237
Published at	https://doi.org/10.1145/3442381.3450141
Other links	https://www.scopus.com/pages/publications/85107914850
Downloads	3442381.3450141 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Inductive entity representations from text via link prediction