Prompting as Probing: Using Language Models for Knowledge Base Construction

Open Access
Authors
  • D. Alivanistos
  • S. Báez Santamaría
  • M Cochez
  • J.-C. Kalo ORCID logo
Publication date 2022
Host editors
  • S. Singhania
  • T.-P. Nguyen
  • S. Razniewski
Book title Proceedings of the Semantic Web Challenge on Knowledge Base Construction from Pre-trained Language Models 2022
Book subtitle co-located with the 21st International Semantic Web Conference (ISWC2022) : virtual event, Hanghzou, China, October 2022
Series CEUR Workshop Proceedings
Event 2022 Semantic Web Challenge on Knowledge Base Construction from Pre-Trained Language Models, LM-KBC 2022
Pages (from-to) 11-34
Number of pages 24
Publisher Aachen: CEUR-WS
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

Language Models (LMs) have proven to be useful in various downstream applications, such as sum-marisation, translation, question answering and text classification. LMs are becoming increasingly important tools in Artificial Intelligence, because of the vast quantity of information they can store. In this work, we present ProP (Prompting as Probing), which utilizes GPT-3, a large Language Model originally proposed by OpenAI in 2020, to perform the task of Knowledge Base Construction (KBC). ProP implements a multi-step approach that combines a variety of prompting techniques to achieve this. Our results show that manual prompt curation is essential, that the LM must be encouraged to give answer sets of variable lengths, in particular including empty answer sets, that true/false questions are a useful device to increase precision on suggestions generated by the LM, that the size of the LM is a crucial factor, and that a dictionary of entity aliases improves the LM score. Our evaluation study indicates that these proposed techniques can substantially enhance the quality of the final predictions: ProP won track 2 of the LM-KBC competition, outperforming the baseline by 36.4 percentage points. Our implementation is available on https://github.com/HEmile/iswc-challenge.

Document type Conference contribution
Language English
Published at https://ceur-ws.org/Vol-3274/paper2.pdf
Other links https://ceur-ws.org/Vol-3274/ https://github.com/HEmile/iswc-challenge https://www.scopus.com/pages/publications/85142885403
Downloads
paper2-5 (Final published version)
Permalink to this page
Back