Detecting and analysing spontaneous oral cancer speech in the wild

Open Access
Authors
Publication date 2020
Journal Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Event 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
Volume | Issue number 21
Pages (from-to) 4826-4830
Organisations
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR)
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR) - Amsterdam Center for Language and Communication (ACLC)
Abstract Oral cancer speech is a disease which impacts more than half a million people worldwide every year. Analysis of oral cancer speech has so far focused on read speech. In this paper, we 1) present and 2) analyse a three-hour long spontaneous oral cancer speech dataset collected from YouTube. 3) We set baselines for an oral cancer speech detection task on this dataset. The analysis of these explainable machine learning baselines shows that sibilants and stop consonants are the most important indicators for spontaneous oral cancer speech detection.
Document type Article
Note Cognitive intelligence for speech processing : 21st Annual Conference of the International Speech Communication Association : INTERSPEECH 2020 : held online due to Covid-19 : Shanghai, China, 25-29 October 2020
Language English
Published at https://doi.org/10.21437/Interspeech.2020-1598
Other links https://zenodo.org/record/4268206#.X-CjXNZ7lFM
Downloads
halpern20_interspeech (Final published version)
Permalink to this page
Back