Questionnaire data analysis using information geometry

Open Access
Authors
Publication date 25-05-2020
Journal Scientific Reports
Article number 8633
Volume | Issue number 10
Number of pages 9
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
  • Interfacultary Research - Institute for Advanced Study (IAS)
Abstract

The analysis of questionnaires often involves representing the high-dimensional responses in a low-dimensional space (e.g., PCA, MCA, or t-SNE). However questionnaire data often contains categorical variables and common statistical model assumptions rarely hold. Here we present a non-parametric approach based on Fisher Information which obtains a low-dimensional embedding of a statistical manifold (SM). The SM has deep connections with parametric statistical models and the theory of phase transitions in statistical physics. Firstly we simulate questionnaire responses based on a non-linear SM and validate our method compared to other methods. Secondly we apply our method to two empirical datasets containing largely categorical variables: an anthropological survey of rice farmers in Bali and a cohort study on health inequality in Amsterdam. Compare to previous analysis and known anthropological knowledge we conclude that our method best discriminates between different behaviours, paving the way to dimension reduction as effective as for continuous data.

Document type Article
Note With supplementary information
Language English
Published at https://doi.org/10.1038/s41598-020-63760-8
Other links https://www.scopus.com/pages/publications/85085390262
Downloads
s41598-020-63760-8 (Final published version)
Supplementary materials
Permalink to this page
Back