Artificially intelligent opinion polling

Open Access
Authors
Publication date 01-03-2026
Journal Royal Society Open Science
Article number 251150
Volume | Issue number 13 | 3
Number of pages 58
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
  • Faculty of Humanities (FGw) - Amsterdam Institute for Humanities Research (AIHR)
Abstract
We present a method for using large language models (LLMs) to conduct public opinion polling with social media data. We sample accounts in the run-up to the 2020 US Presidential election based on recent posts related to the major-party candidates. An LLM extracts structured, survey-like data—including socio-demographic characteristics and voting behaviour—from these digital traces. We show that LLM labelling tends to agree with human raters with respect to characteristics of online users. The labelled social media accounts are mapped to a stratification frame, which facilitates MrP estimation of presidential vote share at the state level. We improve this model-based pre-election polling for severely biased online samples by introducing a bias-correction term. The end-to-end implementation takes unrepresentative, unstructured social media data as inputs and produces timely high-quality area-level estimates as outputs. This is artificially intelligent opinion polling. We show that our artificial intelligence (AI) polling estimates of the 2020 election are highly accurate, on par with estimates produced by state-level polling aggregators such as FiveThirtyEight, or from models fit to extremely expensive high-quality samples.
Document type Article
Language English
Published at https://doi.org/10.1098/rsos.251150
Downloads
rsos.251150 (Final published version)
Permalink to this page
Back