A semi-automated workflow for biodiversity data retrieval, cleaning, and quality control

Open Access
Authors
  • C. Mathew
  • A. Güntsch
  • M. Obst
  • S. Vicario
  • R. Haines
  • A.R. Williams
  • Y. de Jong ORCID logo
  • C. Goble
Publication date 2014
Journal Biodiversity Data Journal
Article number e4221
Volume | Issue number 2
Number of pages 12
Organisations
  • Faculty of Science (FNWI) - Institute for Biodiversity and Ecosystem Dynamics (IBED)
Abstract
The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Data Refinement Workflow which integrates taxonomic data retrieval, data cleaning, and data selection into a consistent, standards-based, and effective system hiding the complexity of underlying service infrastructures. The workflow can be freely used both locally and through a web-portal which does not require additional software installations by users.
Document type Article
Language English
Published at https://doi.org/10.3897/BDJ.2.e4221
Downloads
BDJ_article_4221 (Final published version)
Permalink to this page
Back