Isolating GPU Architectural Features Using Parallelism-Aware Microbenchmarks

R. van Stigt; S.N. Swatman; A.-L. Varbanescu

doi:https://doi.org/10.1145/3489525.3511673

Isolating GPU Architectural Features Using Parallelism-Aware Microbenchmarks

Authors	R. van Stigt S.N. Swatman A.-L. Varbanescu
Publication date	2022
Book title	ICPE '22
Book subtitle	proceedings of the 2022 ACM/SPEC International Conference on Performance Engineering : April 9-13, 2022, Bejing, China
ISBN (electronic)	9781450391436
Event	13th ACM/SPEC International Conference on Performance Engineering
Pages (from-to)	77-88
Publisher	New York, NY: The Association for Computing Machinery
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	GPUs develop at a rapid pace, with new architectures emerging every 12 to 18 months. Every new GPU architecture introduces new features, expecting to improve on previous generations. However, the impact of these changes on the performance of GPGPU applications may not be directly apparent; it is often unclear to developers how exactly these features will affect the performance of their code. In this paper we propose a suite of microbenchmarks to uncover the performance of novel GPU hardware features in isolation. We target features in both the memory system and the arithmetic cores. We further ensure, by design, that our microbenchmarks capture the massively parallel nature of the GPUs, while providing fine-grained timing information at the level of individual compute units. Using this benchmarking suite, we study the differences between three of the most recent NVIDIA architectures: Pascal, Turing, and Ampere. We find that the architecture differences can have a meaningful impact on both synthetic and more realistic applications. This impact is visible both in terms of outright performance, but also affects the choice of execution parameters for realistic applications. We conclude that microbenchmarking, adapted to massive GPU parallelism, can expose differences between GPU generations, and discuss how it can be adapted for future architectures.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1145/3489525.3511673
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Isolating GPU Architectural Features Using Parallelism-Aware Microbenchmarks