Inseq: An interpretability toolkit for sequence generation models

G. Sarti; N. Feldhus; L. Sickert; O. van der Wal

doi:https://doi.org/10.18653/v1/2023.acl-demo.40

Inseq: An interpretability toolkit for sequence generation models

Authors	G. Sarti N. Feldhus L. Sickert O. van der Wal
Publication date	2023
Host editors	D. Bollegala R. Hang A. Ritter
Book title	The 61st Conference of the Association for Computational Linguistics: System Demonstrations
Book subtitle	ACL-DEMO 2023 : Proceedings of the System Demonstrations : July 10-12, 2023
ISBN (electronic)	9781959429708
Event	61st Annual Meeting of the Association for Computational Linguistics, ACL-DEMO 2023
Pages (from-to)	421-435
Number of pages	15
Publisher	Stroudsburg, PA: Association for Computational Linguistics
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	Past work in natural language processing interpretability focused mainly on popular classification tasks while largely overlooking generation settings, partly due to a lack of dedicated tools. In this work, we introduce Inseq1, a Python library to democratize access to interpretability analyses of sequence generation models. Inseq enables intuitive and optimized extraction of models internal information and feature importance scores for popular decoderonly and encoder-decoder Transformers architectures. We showcase its potential by adopting it to highlight gender biases in machine translation models and locate factual knowledge inside GPT-2. Thanks to its extensible interface supporting cutting-edge techniques such as contrastive feature attribution, Inseq can drive future advances in explainable natural language generation, centralizing good practices and enabling fair and reproducible model evaluations.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.18653/v1/2023.acl-demo.40 (Final published version)
Other links	https://www.scopus.com/pages/publications/85165717900
Downloads	2023.acl-demo.40-1 (Final published version)
Supplementary materials	2023.acl-demo.40
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Inseq: An interpretability toolkit for sequence generation models