Responsible data management

Open Access
Authors
  • J. Stoyanovich
  • S. Abiteboul
  • B. Howe
  • H.V. Jagadish
Publication date 06-2022
Journal Communications of the ACM
Volume | Issue number 65 | 6
Pages (from-to) 64-74
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Responsible data management involves incorporating ethical and legal considerations across the life cycle of data collection, analysis, and use in all data-intensive systems, whether they involve machine learning and AI or not. Decisions during data collection and preparation profoundly impact the robustness, fairness, and interpretability of data-intensive systems. Experts must consider these earlier life cycle stages to improve data quality, control for bias, and allow humans to oversee the operation of these systems. Data alone is insufficient to distinguish between a distorted reflection of a perfect world, a perfect reflection of a distorted world, or a combination of both. The assumed or externally verified nature of the distortions must be explicitly stated to allow experts to decide whether and how to mitigate their effects.
Document type Article
Language English
Published at https://doi.org/10.1145/3488717
Other links https://www.scopus.com/pages/publications/85131138349
Downloads
Responsible data management (Final published version)
Permalink to this page
Back