Addressing big data issues in Scientific Data Infrastructure

Authors
Publication date 2013
Host editors
  • W.W. Smari
  • G.C. Fox
Book title Proceedings of the 2013 International Conference on Collaboration Technologies and Systems
Book subtitle May 20-24, 2013, Sheraton San Diego Hotel & Marina, San Diego, California
ISBN
  • 9781467364034
ISBN (electronic)
  • 9781467364041
  • 9781467364027
Event 2013 International Conference on Collaboration Technologies and Systems (CTS 2013)
Pages (from-to) 48-55
Publisher Piscataway, NJ: IEEE
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Big Data are becoming a new technology focus both in science and in industry. This paper discusses the challenges that are imposed by Big Data on the modern and future Scientific Data Infrastructure (SDI). The paper discusses a nature and definition of Big Data that include such features as Volume, Velocity, Variety, Value and Veracity. The paper refers to different scientific communities to define requirements on data management, access control and security. The paper introduces the Scientific Data Lifecycle Management (SDLM) model that includes all the major stages and reflects specifics in data management in modern e-Science. The paper proposes the SDI generic architecture model that provides a basis for building interoperable data or project centric SDI using modern technologies and best practices. The paper explains how the proposed models SDLM and SDI can be naturally implemented using modern cloud based infrastructure services provisioning model and suggests the major infrastructure components for Big Data.
Document type Conference contribution
Language English
Published at https://doi.org/10.1109/CTS.2013.6567203
Permalink to this page
Back