Investigating the global semantic impact of speech recognition error on spoken content collections

Authors
Publication date 2009
Host editors
  • M. Boughanem
  • C. Berrut
  • J. Mothe
  • C. Soule-Dupuy
Book title Advances in Information Retrieval
Book subtitle 31th European Conference on IR Research, ECIR 2009, Toulouse, France, April 6-9, 2009 : proceedings
ISBN
  • 9783642009570
ISBN (electronic)
  • 9783642009587
Series Lecture Notes in Computer Science
Event 31th European Conference on IR Research (ECIR 2009), Toulouse, France
Pages (from-to) 755-760
Publisher Berlin: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Errors in speech recognition transcripts have a negative impact on effectiveness of content-based speech retrieval and present a particular challenge for collections containing conversational spoken content. We propose a Global Semantic Distortion (GSD) metric that measures the collection-wide impact of speech recognition error on spoken content retrieval in a query-independent manner. We deploy our metric to examine the effects of speech recognition substitution errors. First, we investigate frequent substitutions, cases in which the recognizer habitually mis-transcribes one word as another. Although habitual mistakes have a large global impact, the long tail of rare substitutions has a more damaging effect. Second, we investigate semantically similar substitutions, cases in which the word spoken and the word recognized do not diverge radically in meaning. Similar substitutions are shown to have slightly less global impact than semantically dissimilar substitutions.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-642-00958-7_80
Permalink to this page
Back