Database Cracking: Fancy Scan, not Poor Man’s Sort!

H. Pirk; E. Petraki; S. Idreos; S. Manegold; M. Kersten

doi:https://doi.org/10.1145/2619228.2619232

Database Cracking: Fancy Scan, not Poor Man’s Sort!

Authors	H. Pirk E. Petraki S. Idreos S. Manegold M. Kersten
Publication date	2014
Host editors	A. Kemper P. Pandis
Book title	DAMON: Tenth International Workshop on Data Management on New Hardware (DaMoN 2014): Snowbird, UT, USA, June 23, 2014: in conjunction with the ACM SIGMOD/PODS Conference
ISBN	9781450329712
Event	10th International Workshop on Data Management on New Hardware
Pages (from-to)	4
Publisher	Piscataway, NJ: ACM
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Database Cracking is an appealing approach to adaptive indexing: on every range-selection query, the data is partitioned using the supplied predicates as pivots. The core of database cracking is, thus, pivoted partitioning. While pivoted partitioning, like scanning, requires a single pass through the data it tends to have much higher costs due to lower CPU efficiency. In this paper, we conduct an in-depth study of the reasons for the low CPU efficiency of pivoted partitioning. Based on the findings, we develop an optimized version with significantly higher (single-threaded) CPU efficiency. We also develop a number of multi-threaded implementations that are effectively bound by memory bandwidth. Combining all of these optimizations we achieve an implementation that has costs close to or better than an ordinary scan on a variety of systems ranging from low-end (cheaper than $300) desktop machines to high-end (above $60,000) servers.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1145/2619228.2619232
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Database Cracking: Fancy Scan, not Poor Man’s Sort!