Promoting free dialog video corpora: the IFADV corpus example

R.J.J.H. van Son; W. Wesseling; E. Sanders; H. van den Heuvel

doi:https://doi.org/10.1007/978-3-642-04793-0_2

Promoting free dialog video corpora: the IFADV corpus example

Authors	R.J.J.H. van Son W. Wesseling E. Sanders H. van den Heuvel
Publication date	2009
Host editors	M. Kipp J.C. Martin P. Paggio D. Heylen
Book title	Multimodal corpora: from models of natural interaction to systems and applications
ISBN	9783642047923
Series	Lecture notes in computer science, 5509
Pages (from-to)	18-37
Publisher	Berlin: Springer
Organisations	Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract	Research into spoken language has become more visual over the years. Both fundamental and applied research have progressively included gestures, gaze, and facial expression. Corpora of multi-modal conversational speech are rare and frequently difficult to use due to privacy and copyright restrictions. In contrast, Free-and-Libre corpora would allow anyone to add incremental annotations and improvement, distributing the cost of construction and maintenance. A freely available annotated corpus is presented with high quality video recordings of face-to-face conversational speech. An effort has been made to remove copyright and use restrictions. Annotations have been processed to RDBMS tables that allow SQL queries and direct connections to statistical software. A few simple examples are presented to illustrate the use of a databases of annotated speech. From our experiences we would like to advocate the formulation of "best practises" for both legal handling and database storage of recordings and annotations.
Document type	Chapter
Published at	https://doi.org/10.1007/978-3-642-04793-0_2 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Promoting free dialog video corpora: the IFADV corpus example