Unsupervised, Efficient and Semantic Expertise Retrieval

Open Access
Authors
Publication date 2016
Book title WWW'16
Book subtitle proceedings of the 25th International Conference on World Wide Web : May 11-15, 2016, Montreal, Canada
ISBN
  • 9781450341431
Event WWW 2016: The 25th International Conference on World Wide Web
Pages (from-to) 1069-1079
Publisher New York, NY: Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
We introduce an unsupervised discriminative model for the task of retrieving experts in online document collections. We exclusively employ textual evidence and avoid explicit feature engineering by learning distributed word representations in an unsupervised way. We compare our model to state-of-the-art unsupervised statistical vector space and probabilistic generative approaches. Our proposed log-linear model achieves the retrieval performance levels of state-of-the-art document-centric methods with the low inference cost of so-called profile-centric approaches. It yields a statistically significant improved ranking over vector space and generative models in most cases, matching the performance of supervised methods on various benchmarks. That is, by using solely text we can do as well as methods that work with external evidence and/or relevance feedback. A contrastive analysis of rankings produced by discriminative and generative approaches shows that they have complementary strengths due to the ability of the unsupervised discriminative model to perform semantic matching.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2872427.2882974
Downloads
vangysel-unsupervised-2016 (Accepted author manuscript)
Permalink to this page
Back