Cluster-Driven Navigation of the Query Space

Authors
Publication date 05-2016
Journal IEEE Transactions on Knowledge and Data Engineering
Volume | Issue number 28 | 5
Pages (from-to) 1118-1131
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
How can users who know neither programming nor statistics explore large databases? We present a novel interface, designed to guide explorers through their data: Blaeu. Blaeu is a database front-end, “boosted” with unsupervised learning primitives. Thanks to these primitives, it can summarize and recommend queries. Our first contribution is Blaeu's interaction model. With Blaeu, users explore the data through data maps. A data map is an interactive set of clusters, which users navigate with zooms and projections. Our second contribution is Blaeu's engine. We present three mapping algorithms, for three different settings. The first algorithm deals with small to medium databases, the second one targets high dimensional spaces, and the last one focuses on speed and interaction. We then present an optimization strategy based on sampling. Our experiments reveal that Blaeu can cluster millions of tuples with hundreds of columns in a few seconds on commodity hardware.
Document type Article
Language English
Published at https://doi.org/10.1109/TKDE.2016.2515590
Other links https://ivi.fnwi.uva.nl/isis/publications/2016/SellamPVLDB2016c
Permalink to this page
Back