Learning Topic-Sensitive Word Representations

Open Access
Authors
Publication date 2017
Host editors
  • R. Barzilay
  • M.-Y. Kan
Book title The 55th Annual Meeting of the Association for Computational Linguistics
Book subtitle proceedings of the Conference : July 30-August 4, 2017, Vancouver, Canada
ISBN
  • 9781945626760
Event Annual Meeting of the Association for Computational Linguistics
Volume | Issue number 2
Pages (from-to) 441-447
Publisher Stroudsburg, PA: Association for Computational Linguistics
Organisations
  • Faculty of Science (FNWI)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Distributed word representations are widely used for modeling words in NLP tasks. Most of the existing models generate one representation per word and do not consider different meanings of a word. We present two approaches to learn multiple topic-sensitive representations per word by using Hierarchical Dirichlet Process. We observe that by modeling topics and integrating topic distributions for each document we obtain representations that are able to distinguish between different meanings of a given word. Our models yield statistically significant improvements for the lexical substitution task indicating that commonly used single word representations, even when combined with contextual information, are insufficient for this task.
Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/P17-2070
Other links http://aclweb.org/anthology/attachments/P/P17/P17-2070.Presentation.pdf
Downloads
P17-2070 (Final published version)
Permalink to this page
Back