Contemporary Challenges for Data-intensive Scientific Workflow Management Systems

Authors
Publication date 2015
Book title Proceedings of WORKS 2015
Book subtitle the 10th Workshop on Workflows in Support of Large-Scale Science : November 15, 2015
ISBN (electronic)
  • 9781450339896
Event International workshop on Workflows in support of large-scale science (WORKS 15), in the context of IEEE Supercomputing 2015
Article number 4
Number of pages 11
Publisher New York, NY: The Association for Computing Machinery
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Data-intensive sciences now represent the forefront of current scientific computing. To handle this 'Big Data' focus, scientists demand enabling technologies that can adapt to the increasingly distributed, collaborative, and exploratory scientific milieu. However, how these challenges have changed the design requirements of scientific workflow management systems (SWMSs) has not been assessed. First, how scientists currently use SWMSs was determined through a comprehensive usage survey examining 1455 research publications from 2013 to July 31st, 2015. To understand how data-intensive scientists are producing impactful research, we further examined usage of two major research clouds, the Open Science Data Cloud (OSDC) and Cornell's Red Cloud. Here, we present a road map for SWMS development for data-intensive sciences. SWMSs are now needed that interconnect diverse software packages while enabling data exploration and multi-user interaction across distributed software and hardware environments.
Document type Conference contribution
Language English
Published at https://doi.org/10.1145/2822332.2822336
Permalink to this page
Back