Calibrating aquatic microfossil proxies with regression-tree ensembles: Cross-validation with modern chironomid and diatom data

Authors
  • J.S. Salonen
  • A.J. Verster
  • S. Engels
  • J. Soininen
  • M. Trachsel
  • M. Luoto
Publication date 07-2016
Journal Holocene
Volume | Issue number 26 | 7
Pages (from-to) 1040-1048
Organisations
  • Faculty of Science (FNWI)
  • Faculty of Science (FNWI) - Institute for Biodiversity and Ecosystem Dynamics (IBED)
Abstract
We examine the ability of four different regression-tree ensemble techniques (bagging, random forest, rotation forest and boosted tree) in calibration of aquatic microfossil proxies. The methods are tested with six chironomid and diatom datasets, using a variety of cross-validation schemes. We find random forest, rotation forest and the boosted tree to have a similar performance, while bagging performs less well and in several cases has trouble producing continuous predictions. In comparison with commonly used parametric transfer-function approaches (PLS, WA, WA-PLS), we find that in some cases tree-ensemble methods outperform the best-performing transfer-function technique, especially with large datasets characterized by complex taxon responses and abundant noise. However, parametric transfer functions remain competitive with datasets characterized by low number of samples or linear taxon responses. We present an implementation of the rotation forest algorithm in R.
Document type Article
Language English
Published at https://doi.org/10.1177/0959683616632881
Permalink to this page
Back