Online semi-supervised learning: algorithm and application in metagenomics

Authors
Publication date 2013
Host editors
  • G.Z. Li
  • S. Kim
  • M. Hughes
  • G. McLachlan
  • H. Sun
  • X. Hu
  • H. Ressom
  • B. Liu
  • M. Liebman
Book title Proceedings: 2013 IEEE International Conference on Bioinformatics and Biomedicine: 18-21 December 2013, Shanghai, China
ISBN
  • 9781479913091
Event 2013 IEEE International Conference on Bioinformatics and Biomedicine
Pages (from-to) 521-525
Publisher Piscataway, NJ: IEEE
Organisations
  • Faculty of Dentistry (ACTA)
Abstract
As the amount of metagenomic data grows rapidly, online statistical learning algorithms are poised to play key role in metagenome analysis tasks. Frequently, data are only partially labeled, namely dataset contains partial information about the problem of interest. This work presents an algorithm and a learning framework that is naturally suitable for the analysis of large scale, partially labeled metagenome datasets. We propose an online multi-output algorithm that learns by sequentially co-regularizing prediction functions on unlabeled data points and provides improved performance in comparison to several supervised methods. We evaluate predictive performance of the proposed methods on NIH Human Microbiome Project dataset. In particular we address the task of predicting relative abundance of Porphyromonas species in the oral cavity. In our empirical evaluation the proposed method outperforms several supervised regression techniques as well as leads to notable computational benefits when training the predictive model.
Document type Conference contribution
Note INSPEC Accession Number: 14079664
Language English
Published at https://doi.org/10.1109/BIBM.2013.6732550
Permalink to this page
Back