Efficient Search Algorithms for Identifying Synergistic Associations in High-Dimensional Datasets

Open Access
Authors
  • B.H. Mishra
  • M. Kähönen
  • O.T. Raitakari
  • R. Laaksonen
  • L. Keltikangas-Järvinen
  • M. Juonala
  • R. Quax ORCID logo
Publication date 11-2024
Journal Entropy
Article number 968
Volume | Issue number 26 | 11
Number of pages 35
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
  • Faculty of Social and Behavioural Sciences (FMG) - Psychology Research Institute (PsyRes)
Abstract

In recent years, there has been a notably increased interest in the study of multivariate interactions and emergent higher-order dependencies. This is particularly evident in the context of identifying synergistic sets, which are defined as combinations of elements whose joint interactions result in the emergence of information that is not present in any individual subset of those elements. The scalability of frameworks such as partial information decomposition (PID) and those based on multivariate extensions of mutual information, such as O-information, is limited by combinational explosion in the number of sets that must be assessed. In order to address these challenges, we propose a novel approach that utilises stochastic search strategies in order to identify synergistic triplets within datasets. Furthermore, the methodology is extensible to larger sets and various synergy measures. By employing stochastic search, our approach circumvents the constraints of exhaustive enumeration, offering a scalable and efficient means to uncover intricate dependencies. The flexibility of our method is illustrated through its application to two epidemiological datasets: The Young Finns Study and the UK Biobank Nuclear Magnetic Resonance (NMR) data. Additionally, we present a heuristic for reducing the number of synergistic sets to analyse in large datasets by excluding sets with overlapping information. We also illustrate the risks of performing a feature selection before assessing synergistic information in the system.

Document type Article
Note In special issue: Topological Data Analysis Meets Information Theory. New Perspectives for the Analysis of Higher-Order Interactions in Complex Systems
Language English
Published at https://doi.org/10.3390/e26110968
Other links https://www.scopus.com/pages/publications/85210424265
Downloads
entropy-26-00968-v2 (Final published version)
Permalink to this page
Back