Verifying Peephole Rewriting in SSA Compiler IRs

Open Access
Authors
  • Tobias Grosser
Publication date 09-2024
Host editors
  • Yves Bertot
  • Temur Kutsia
  • Michael Norrish
Book title 15th International Conference on Interactive Theorem Proving
Book subtitle ITP 2024, September 9–14, 2024, Tbilisi, Georgia
ISBN (electronic)
  • 9783959773379
Series Leibniz International Proceedings in Informatics
Event 15th International Conference on Interactive Theorem Proving, ITP 2024
Article number 9
Number of pages 20
Publisher Saarbrücken/Wadern: Schloss Dagstuhl - Leibniz-Zentrum für Informatik
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

There is an increasing need for domain-specific reasoning in modern compilers. This has fueled the use of tailored intermediate representations (IRs) based on static single assignment (SSA), like in the MLIR compiler framework. Interactive theorem provers (ITPs) provide strong guarantees for the end-to-end verification of compilers (e.g., CompCert). However, modern compilers and their IRs evolve at a rate that makes proof engineering alongside them prohibitively expensive. Nevertheless, well-scoped push-button automated verification tools such as the Alive peephole verifier for LLVM-IR gained recognition in domains where SMT solvers offer efficient (semi) decision procedures. In this paper, we aim to combine the convenience of automation with the versatility of ITPs for verifying peephole rewrites across domain-specific IRs. We formalize a core calculus for SSA-based IRs that is generic over the IR and covers so-called regions (nested scoping used by many domain-specific IRs in the MLIR ecosystem). Our mechanization in the Lean proof assistant provides a user-friendly frontend for translating MLIR syntax into our calculus. We provide scaffolding for defining and verifying peephole rewrites, offering tactics to eliminate the abstraction overhead of our SSA calculus. We prove correctness theorems about peephole rewriting, as well as two classical program transformations. To evaluate our framework, we consider three use cases from the MLIR ecosystem that cover different levels of abstractions: (1) bitvector rewrites from LLVM, (2) structured control flow, and (3) fully homomorphic encryption. We envision that our mechanization provides a foundation for formally verified rewrites on new domain-specific IRs.

Document type Conference contribution
Language English
Published at https://doi.org/10.4230/LIPIcs.ITP.2024.9
Other links https://github.com/opencompl/lean-mlir/tree/ITP24 https://www.scopus.com/pages/publications/85204034217
Downloads
LIPIcs.ITP.2024.9 (Final published version)
Permalink to this page
Back