Explainable user clustering in short text streams

Open Access
Authors
  • Y. Zhao
  • S. Liang
  • Z. Ren
  • J. Ma
Publication date 2016
Book title SIGIR'16
Book subtitle the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval: Pisa, Italy , July 17-21, 2016
ISBN (electronic)
  • 9781450340694
Event SIGIR 2016: 39th international ACM SIGIR conference on Research and development in information retrieval
Pages (from-to) 155-164
Number of pages 10
Publisher New York, NY: Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
User clustering has been studied from different angles: behavior-based, to identify similar browsing or search patterns, and content-based, to identify shared interests. Once user clusters have been found, they can be used for recommendation and personalization. So far, content-based user clustering has mostly focused on static sets of relatively long documents. Given the dynamic nature of social media, there is a need to dynamically cluster users in the context of short text streams. User clustering in this setting is more challenging than in the case of long documents as it is difficult to capture the users' dynamic topic distributions in sparse data settings. To address this problem, we propose a dynamic user clustering topic model (or UCT for short). UCT adaptively tracks changes of each user's time-varying topic distribution based both on the short texts the user posts during a given time period and on the previously estimated distribution. To infer changes, we propose a Gibbs sampling algorithm where a set of word-pairs from each user is constructed for sampling. The clustering results are explainable and human-understandable, in contrast to many other clustering algorithms. For evaluation purposes, we work with a dataset consisting of users and tweets from each user. Experimental results demonstrate the effectiveness of our proposed clustering model compared to state-of-the-art baselines.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2911451.2911522
Downloads
zhao-explainable-2016 (Accepted author manuscript)
Permalink to this page
Back