Inseq: An interpretability toolkit for sequence generation models

Open Access
Authors
Publication date 2023
Host editors
  • D. Bollegala
  • R. Hang
  • A. Ritter
Book title The 61st Conference of the Association for Computational Linguistics: System Demonstrations
Book subtitle ACL-DEMO 2023 : Proceedings of the System Demonstrations : July 10-12, 2023
ISBN (electronic)
  • 9781959429708
Event 61st Annual Meeting of the Association for Computational Linguistics, ACL-DEMO 2023
Pages (from-to) 421-435
Number of pages 15
Publisher Stroudsburg, PA: Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract

Past work in natural language processing interpretability focused mainly on popular classification tasks while largely overlooking generation settings, partly due to a lack of dedicated tools. In this work, we introduce Inseq1, a Python library to democratize access to interpretability analyses of sequence generation models. Inseq enables intuitive and optimized extraction of models internal information and feature importance scores for popular decoderonly and encoder-decoder Transformers architectures. We showcase its potential by adopting it to highlight gender biases in machine translation models and locate factual knowledge inside GPT-2. Thanks to its extensible interface supporting cutting-edge techniques such as contrastive feature attribution, Inseq can drive future advances in explainable natural language generation, centralizing good practices and enabling fair and reproducible model evaluations.

Document type Conference contribution
Language English
Published at https://doi.org/10.18653/v1/2023.acl-demo.40
Other links https://www.scopus.com/pages/publications/85165717900
Downloads
2023.acl-demo.40-1 (Final published version)
Supplementary materials
Permalink to this page
Back