Search results

    Filter results

  • Full text

  • Document type

  • Publication year

  • Organisation

Results: 11
Number of items: 11
  • Open Access
    Mueller, A., Geiger, A., Wiegreffe, S., Arad, D., Arcuschin, I., Belfki, A., Chan, Y. S., Fiotto-Kaufman, J., Haklay, T., Hanna, M., Huang, J., Gupta, R., Nikankin, Y., Orgad, H., Prakash, N., Reusch, A., Sankaranarayanan, A., Shao, S., Stolfo, A., ... Belinkov, Y. (2025). MIB: A Mechanistic Interpretability Benchmark. Proceedings of Machine Learning Research, 267, 45069-45108. https://proceedings.mlr.press/v267/mueller25a.html
  • Open Access
    Hanna, M., Piotrowski, M., Lindsey, J., & Ameisen, E. (2025). Circuit-Tracer: A New Library for Finding Feature Circuits. In Y. Belinkov, A. Mueller, N. Kim, H. Mohebbi, H. Chen, D. Arad, & G. Sarti (Eds.), The 8th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP: BlackboxNLP 2025 : proceedings of the workshop: November 9, 2025 (pp. 239-249). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.blackboxnlp-1.14
  • Open Access
    Bavaresco, A., Bernardi, R., Bertolazzi, L., Elliott, D., Fernández, R., Gatt, A., Ghaleb, E., Giulianelli, M., Hanna, M., Koller, A., Martins, A. F. T., Mondorf, P., Neplenbroek, V., Pezzelle, S., Plank, B., Schlangen, D., Suglia, A., Surikuchi, A. K., Takmaz, E., & Testoni, A. (2025). LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks. In W. Che, J. Nabende, E. Shutova, & M. T. Pilehvar (Eds.), The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025) : proceedings of the conference: ACL 2025 : July 27-August 1, 2025 (Vol. 2, pp. 238–255). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.acl-short.20
  • Open Access
    Hanna, M., & Mueller, A. (2025). Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models. In L. Chiruzzo, A. Ritter, & L. Wang (Eds.), Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: proceedings of the conference: NAACL 2025 : April 29-May 4, 2025 (Vol. 1, pp. 3181–3203). Association for Computational Linguistics. https://doi.org/10.48550/arXiv.2412.05353, https://doi.org/10.18653/v1/2025.naacl-long.164
  • Open Access
    Mohebbi, H., Jumelet, J., Hanna, M., Alishahi, A., & Zuidema, W. (2024). Transformer-specific Interpretability. In M. Mesgar, & S. Loáiciga (Eds.), The 18th Conference of the European Chapter of the Association for Computational Linguistics : Proceedings of Tutorial Abstracts: EACL : March 21, 2024 (pp. 21-26). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.eacl-tutorials.4
  • Open Access
    Wildenburg, F., Hanna, M., & Pezzelle, S. (2024). Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!. In L.-W. Ku, A. Martins, & V. Srikumar (Eds.), The 62nd Annual Meeting of the Association for Computational Linguistics : Findings of the Association for Computational Linguistics: ACL 2024: ACL 2024 : August 11-16, 2024 (pp. 9598-9613). Association for Computational Linguistics. https://doi.org/10.48550/arXiv.2402.12486, https://doi.org/10.18653/v1/2024.findings-acl.572
  • Open Access
    Hanna, M., Zamparelli, R., & Mareček, D. (2023). The Functional Relevance of Probed Information: A Case Study. In A. Vlachos, & I. Augenstein (Eds.), The 17th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2023 : proceedings of the conference : May 2-6, 2023 (pp. 835-848). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.eacl-main.58
  • Open Access
    Hanna, M., Liu, O., & Variengien, A. (2023). How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model. In A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, & S. Levine (Eds.), 37th Conference on Neural Information Processing Systems (NeurIPS 2023): 10-16 December 2023, New Orleans, Louisana, USA (Advances in Neural Information Processing Systems; Vol. 36). Neural Information Processing Systems Foundation. https://papers.nips.cc/paper_files/paper/2023/hash/efbba7719cc5172d175240f24be11280-Abstract-Conference.html
  • Open Access
    Chintam, A., Beloch, R., Zuidema, W., Hanna, M., & van der Wal, O. (2023). Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model. In Y. Belinkov, S. Hao, J. Jumelet, N. Kim, A. McCarthy, & H. Mohebbi (Eds.), BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP: Proceedings of the Sixth Workshop : EMNLP 2023 : December 7, 2023 (pp. 379-394). The Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.blackboxnlp-1.29
  • Open Access
    Jumelet, J., Hanna, M., de Heer Kloots, M., Langedijk, A., Pouw, C., & van der Wal, O. (2023). ChapGTP, ILLC’s Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation. In A. Warstadt, A. Mueller, L. Choshen, E. Wilcox, C. Zhuang, J. Ciro, R. Mosquera, B. Paranjabe, A. Williams, T. Linzen, & R. Cotterell (Eds.), Findings of the BabyLM Challenge: Sample-efficient pretraining on developmentally plausible corpora (pp. 74-85). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.conll-babylm.6
Page 1 of 2