Effective distributed representations for academic expert search

Open Access
Authors
Publication date 2020
Host editors
  • M.K. Chandrasekaran
  • A. de Waard
  • G. Feigenblat
  • D. Freitag
  • T. Ghosal
  • E. Hovy
  • P. Knoth
  • D. Konopnicki
  • P. Mayr
  • R.M. Patton
  • M. Shmueli-Scheuer
Book title First Workshop on Scholarly Document Processing
Book subtitle EMNLP 2020 : proceedings of the workshop : November 19, 2020, Online
ISBN (electronic)
  • 9781952148705
Event 1st Workshop on Scholarly Document Processing
Pages (from-to) 56-71
Number of pages 16
Publisher Stroudsburg, PA: The Association for Computational Linguistics
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Expert search aims to find and rank experts based on a user's query. In academia, retrieving experts is an efficient way to navigate through a large amount of academic knowledge. Here, we study how different distributed representations of academic papers (i.e. embeddings) impact academic expert retrieval. We use the Microsoft Academic Graph dataset and experiment with different configurations of a document-centric voting model for retrieval. In particular, we explore the impact of the use of contextualized embeddings on search performance. We also present results for paper embeddings that incorporate citation information through retrofitting. Additionally, experiments are conducted using different techniques for assigning author weights based on author order. We observe that using contextual embeddings produced by a transformer model trained for sentence similarity tasks produces the most effective paper representations for document-centric expert retrieval. However, retrofitting the paper embeddings and using elaborate author contribution weighting strategies did not improve retrieval performance.
Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/2020.sdp-1.7
Downloads
2020.sdp-1.7 (Final published version)
Permalink to this page
Back