ViTOR: Learning to rank webpages based on visual features

Open Access
Authors
Publication date 2019
Book title The Web Conference 2019
Book subtitle proceedings of the World Wide Web Conference WWW 2019 : May 13-17, 2019, San Francisco, CA, USA
ISBN (electronic)
  • 9781450366748
Event 2019 World Wide Web Conference, WWW 2019
Pages (from-to) 3279-3285
Number of pages 7
Publisher New York: Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
  • Faculty of Science (FNWI)
Abstract

The visual appearance of a webpage carries valuable information about the page's quality and can be used to improve the performance of learning to rank (LTR). We introduce the Visual learning TO Rank (ViTOR) model that integrates state-of-the-art visual features extraction methods: (i) transfer learning from a pre-trained image classification model, and (ii) synthetic saliency heat maps generated from webpage snapshots. Since there is currently no public dataset for the task of LTR with visual features, we also introduce and release the ViTOR dataset, containing visually rich and diverse webpages. The ViTOR dataset consists of visual snapshots, non-visual features and relevance judgments for ClueWeb12 webpages and TREC Web Track queries. We experiment with the proposed ViTOR model on the ViTOR dataset and show that it significantly improves the performance of LTR with visual features.

Document type Conference contribution
Note © 2019 International World Wide Web Conference Committee.
Language English
Related dataset ViTOR: Learning to Rank Webpages Based on Visual Features
Published at https://doi.org/10.1145/3308558.3313419
Other links https://www.scopus.com/pages/publications/85066886242
Downloads
p3279-akker (Final published version)
Permalink to this page
Back