Language that Captivates the Audience: Predicting Affective Ratings of TED Talks in a Multi-Label Classification Task

Open Access
Authors
Publication date 2021
Host editors
  • O. De Clerq
  • A. Balahur
  • J. Sedoc
  • V. Barriere
  • S. Tafreshi
  • S. Buechel
  • V. Hoste
Book title The Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis
Book subtitle Proceedings of the Eleventh Workshop : EACL 2021 : April 19, 2021, held online due to the COVID situation
ISBN (electronic)
  • 9781954085183
Event 11th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis
Pages (from-to) 13–24
Publisher Stroudsburg, PA: The Association for Computational Linguistics
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract
The aim of the paper is twofold: (1) to automatically predict the ratings assigned by viewers to 14 categories available for TED talks in a multi-label classification task and (2) to determine what types of features drive classification accuracy for each of the categories. The focus is on features of language usage from five groups pertaining to syntactic complexity, lexical richness, register-based n-gram measures, information-theoretic measures and LIWC-style measures. We show that a Recurrent Neural Network classifier trained exclusively on within-text distributions of such features can reach relatively high levels of overall accuracy (69%) across the 14 categories. We find that features from two groups are strong predictors of the affective ratings across all categories and that there are distinct patterns of language usage for each rating category.
Document type Conference contribution
Note With supplementary material
Language English
Published at https://aclanthology.org/2021.wassa-1.2
Downloads
2021.wassa-1.2 (Final published version)
Supplementary materials
Permalink to this page
Back