ACM SIGIR 2008: Thirty-first Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 20-24 July 2008, Singapore: Proceedings
ISBN
9781605581644
Event
31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), Singapore
Pages (from-to)
773-774
Publisher
New York, NY: Association for Computing Machinery (ACM)
Organisations
Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
User generated spoken audio remains a challenge for Automatic Speech Recognition (ASR) technology and content-based audio surrogates derived from ASR-transcripts must be error robust. An investigation of the use of term clouds as surrogates for podcasts demonstrates that ASR term clouds closely approximate term clouds derived from human-generated transcripts across a range of cloud sizes. A user study confirms the conclusion that ASR-clouds are viable surrogates for depicting the content of podcasts.