Deconstructing Multimodality: Visual Properties and Visual Context in Human Semantic Processing

C. Davis; L. Bulat; A. Vero; E. Shutova

doi:https://doi.org/10.18653/v1/S19-1013

Deconstructing Multimodality: Visual Properties and Visual Context in Human Semantic Processing

Authors	C. Davis L. Bulat A. Vero E. Shutova
Publication date	2019
Host editors	R. Mihalcea E. Shutova L.-W. Ku K. Evang S. Poria
Book title	Lexical and Computational Semantics (*SEM)
Book subtitle	NAACL HLT 2019 : proceedings of the Eighth Conference : June 6-7, 2019, Minneapolis
ISBN (electronic)	9781948087933
Event	The Eighth Joint Conference on Lexical and Computational Semantics - *SEM 2019
Pages (from-to)	118-124
Publisher	Stroudsburg, PA: The Association for Computational Linguistics
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	Multimodal semantic models that extend linguistic representations with additional perceptual input have proved successful in a range of natural language processing (NLP) tasks. Recent research has successfully used neural methods to automatically create visual representations for words. However, these works have extracted visual features from complete images, and have not examined how different kinds of visual information impact performance. In contrast, we construct multimodal models that differentiate between internal visual properties of the objects and their external visual context. We evaluate the models on the task of decoding brain activity associated with the meanings of nouns, demonstrating their advantage over those based on complete images.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.18653/v1/S19-1013 (Final published version)
Downloads	S19-1013 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Deconstructing Multimodality: Visual Properties and Visual Context in Human Semantic Processing