Data Integration Landscapes The Case for Non-optimal Solutions in Network Diffusion Models

Open Access
Authors
Publication date 2023
Host editors
  • J. Mikyška
  • C. de Mulatier
  • M. Paszynski
  • V.V. Krzhizhanovskaya
  • J.J. Dongarra
  • P.M.A. Sloot
Book title Computational Science – ICCS 2023
Book subtitle 23rd International Conference, Prague, Czech Republic, July 3–5, 2023 : proceedings
ISBN
  • 9783031359941
ISBN (electronic)
  • 9783031359958
Series Lecture Notes in Computer Science
Event 23rd International Conference on Computational Science, ICCS 2023
Volume | Issue number I
Pages (from-to) 494-508
Number of pages 15
Publisher Cham: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

The successful application of computational models presupposes access to accurate, relevant, and representative datasets. The growth of public data, and the increasing practice of data sharing and reuse, emphasises the importance of data provenance and increases the need for modellers to understand how data processing decisions might impact model output. One key step in the data processing pipeline is that of data integration and entity resolution, where entities are matched across disparate datasets. In this paper, we present a new formulation of data integration in complex networks that incorporates integration uncertainty. We define an approach for understanding how different data integration setups can impact the results of network diffusion models under this uncertainty, allowing one to systematically characterise potential model outputs in order to create an output distribution that provides a more comprehensive picture.

Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-031-35995-8_35
Other links https://www.scopus.com/pages/publications/85164940830
Downloads
978-3-031-35995-8_35 (Final published version)
Permalink to this page
Back