A framework for unsupervised spam detection in social networking sites

M. Bosma; E. Meij; W. Weerkamp

doi:https://doi.org/10.1007/978-3-642-28997-2_31

A framework for unsupervised spam detection in social networking sites

Authors	M. Bosma E. Meij W. Weerkamp
Publication date	2012
Host editors	R. Baeza-Yates A.P. de Vries H. Zaragoza B.B. Cambazoglu V. Murdock R. Lempel F. Silvestri
Book title	Advances in Information Retrieval
Book subtitle	34th European Conference on IR Research, ECIR 2012, Barcelona, Spain, April 1-5, 2012: proceedings
ISBN	9783642289965
ISBN (electronic)	9783642289972
Series	Lecture Notes in Computer Science
Event	34th European Conference on Information Retrieval (ECIR 2012)
Pages (from-to)	364-375
Publisher	Heidelberg: Springer
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Social networking sites offer users the option to submit user spam reports for a given message, indicating this message is inappropriate. In this paper we present a framework that uses these user spam reports for spam detection. The framework is based on the HITS web link analysis framework and is instantiated in three models. The models subsequently introduce propagation between messages reported by the same user, messages authored by the same user, and messages with similar content. Each of the models can also be converted to a simple semi-supervised scheme. We test our models on data from a popular social network and compare the models to two baselines, based on message content and raw report counts. We find that our models outperform both baselines and that each of the additions (reporters, authors, and similar messages) further improves the performance of the framework.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1007/978-3-642-28997-2_31 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

A framework for unsupervised spam detection in social networking sites