AnaLog: Testing Analytical and Deductive Logic Learnability in Language Models

S. Ryb; M. Giulianelli; A. Sinclair; R. Fernández

doi:https://doi.org/10.18653/v1/2022.starsem-1.5

AnaLog: Testing Analytical and Deductive Logic Learnability in Language Models

Authors	S. Ryb M. Giulianelli A. Sinclair R. Fernández
Publication date	2022
Host editors	V. Nastase E. Pavlick M.T. Pilehvar J. Camacho-Collados A. Raganato
Book title	The 11th Joint Conference on Lexical and Computational Semantics
Book subtitle	*SEM 2022 : proceedings of the conference : July 14-15, 2022
ISBN (electronic)	9781955917988
Event	The 11th Joint Conference on Lexical and Computational Semantics
Pages (from-to)	55-68
Number of pages	14
Publisher	Stroudsburg, PA: Association for Computational Linguistics
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	We investigate the extent to which pre-trained language models acquire analytical and deductive logical reasoning capabilities as a side effect of learning word prediction. We present AnaLog, a natural language inference task designed to probe models for these capabilities, controlling for different invalid heuristics the models may adopt instead of learning the desired generalisations. We test four languagemodels on AnaLog, finding that they have all learned, to a different extent, to encode information that is predictive of entailment beyond shallow heuristics such as lexical overlap and grammaticality. We closely analyse the best performing language model and show that while it performs more consistently than other language models across logical connectives and reasoning domains, it still is sensitive to lexical and syntactic variations in the realisation of logical statements.
Document type	Conference contribution
Language	English
Related dataset	AnaLog
Published at	https://doi.org/10.18653/v1/2022.starsem-1.5 (Final published version)
Downloads	2022.starsem-1.5 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

AnaLog: Testing Analytical and Deductive Logic Learnability in Language Models