Search results
Results: 11
Number of items: 11
-
Mueller, A., Geiger, A., Wiegreffe, S., Arad, D., Arcuschin, I., Belfki, A., Chan, Y. S., Fiotto-Kaufman, J., Haklay, T., Hanna, M., Huang, J., Gupta, R., Nikankin, Y., Orgad, H., Prakash, N., Reusch, A., Sankaranarayanan, A., Shao, S., Stolfo, A., ... Belinkov, Y. (2025). MIB: A Mechanistic Interpretability Benchmark. Proceedings of Machine Learning Research, 267, 45069-45108. https://proceedings.mlr.press/v267/mueller25a.html -
Hanna, M., Piotrowski, M., Lindsey, J., & Ameisen, E. (2025). Circuit-Tracer: A New Library for Finding Feature Circuits. In Y. Belinkov, A. Mueller, N. Kim, H. Mohebbi, H. Chen, D. Arad, & G. Sarti (Eds.), The 8th BlackboxNLP Workshop: Analyzing and Interpreting Neural Networks for NLP: BlackboxNLP 2025 : proceedings of the workshop: November 9, 2025 (pp. 239-249). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.blackboxnlp-1.14 -
Bavaresco, A., Bernardi, R., Bertolazzi, L., Elliott, D., Fernández, R., Gatt, A., Ghaleb, E., Giulianelli, M., Hanna, M., Koller, A., Martins, A. F. T., Mondorf, P., Neplenbroek, V., Pezzelle, S., Plank, B., Schlangen, D., Suglia, A., Surikuchi, A. K., Takmaz, E., & Testoni, A. (2025). LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks. In W. Che, J. Nabende, E. Shutova, & M. T. Pilehvar (Eds.), The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025) : proceedings of the conference: ACL 2025 : July 27-August 1, 2025 (Vol. 2, pp. 238–255). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.acl-short.20 -
Hanna, M., & Mueller, A. (2025). Incremental Sentence Processing Mechanisms in Autoregressive Transformer Language Models. In L. Chiruzzo, A. Ritter, & L. Wang (Eds.), Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: proceedings of the conference: NAACL 2025 : April 29-May 4, 2025 (Vol. 1, pp. 3181–3203). Association for Computational Linguistics. https://doi.org/10.48550/arXiv.2412.05353, https://doi.org/10.18653/v1/2025.naacl-long.164 -
Mohebbi, H., Jumelet, J., Hanna, M., Alishahi, A., & Zuidema, W. (2024). Transformer-specific Interpretability. In M. Mesgar, & S. Loáiciga (Eds.), The 18th Conference of the European Chapter of the Association for Computational Linguistics : Proceedings of Tutorial Abstracts: EACL : March 21, 2024 (pp. 21-26). Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.eacl-tutorials.4 -
Wildenburg, F., Hanna, M., & Pezzelle, S. (2024). Do Pre-Trained Language Models Detect and Understand Semantic Underspecification? Ask the DUST!. In L.-W. Ku, A. Martins, & V. Srikumar (Eds.), The 62nd Annual Meeting of the Association for Computational Linguistics : Findings of the Association for Computational Linguistics: ACL 2024: ACL 2024 : August 11-16, 2024 (pp. 9598-9613). Association for Computational Linguistics. https://doi.org/10.48550/arXiv.2402.12486, https://doi.org/10.18653/v1/2024.findings-acl.572 -
Hanna, M., Zamparelli, R., & Mareček, D. (2023). The Functional Relevance of Probed Information: A Case Study. In A. Vlachos, & I. Augenstein (Eds.), The 17th Conference of the European Chapter of the Association for Computational Linguistics: EACL 2023 : proceedings of the conference : May 2-6, 2023 (pp. 835-848). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.eacl-main.58 -
Hanna, M., Liu, O., & Variengien, A. (2023). How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model. In A. Oh, T. Naumann, A. Globerson, K. Saenko, M. Hardt, & S. Levine (Eds.), 37th Conference on Neural Information Processing Systems (NeurIPS 2023): 10-16 December 2023, New Orleans, Louisana, USA (Advances in Neural Information Processing Systems; Vol. 36). Neural Information Processing Systems Foundation. https://papers.nips.cc/paper_files/paper/2023/hash/efbba7719cc5172d175240f24be11280-Abstract-Conference.html -
Chintam, A., Beloch, R., Zuidema, W., Hanna, M., & van der Wal, O. (2023). Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model. In Y. Belinkov, S. Hao, J. Jumelet, N. Kim, A. McCarthy, & H. Mohebbi (Eds.), BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP: Proceedings of the Sixth Workshop : EMNLP 2023 : December 7, 2023 (pp. 379-394). The Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.blackboxnlp-1.29 -
Jumelet, J., Hanna, M., de Heer Kloots, M., Langedijk, A., Pouw, C., & van der Wal, O. (2023). ChapGTP, ILLC’s Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation. In A. Warstadt, A. Mueller, L. Choshen, E. Wilcox, C. Zhuang, J. Ciro, R. Mosquera, B. Paranjabe, A. Williams, T. Linzen, & R. Cotterell (Eds.), Findings of the BabyLM Challenge: Sample-efficient pretraining on developmentally plausible corpora (pp. 74-85). Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.conll-babylm.6
Page 1 of 2