Online semi-supervised learning: algorithm and application in metagenomics
| Authors |
|
|---|---|
| Publication date | 2013 |
| Host editors |
|
| Book title | Proceedings: 2013 IEEE International Conference on Bioinformatics and Biomedicine: 18-21 December 2013, Shanghai, China |
| ISBN |
|
| Event | 2013 IEEE International Conference on Bioinformatics and Biomedicine |
| Pages (from-to) | 521-525 |
| Publisher | Piscataway, NJ: IEEE |
| Organisations |
|
| Abstract |
As the amount of metagenomic data grows rapidly, online statistical learning algorithms are poised to play key role in metagenome analysis tasks. Frequently, data are only partially labeled, namely dataset contains partial information about the problem of interest. This work presents an algorithm and a learning framework that is naturally suitable for the analysis of large scale, partially labeled metagenome datasets. We propose an online multi-output algorithm that learns by sequentially co-regularizing prediction functions on unlabeled data points and provides improved performance in comparison to several supervised methods. We evaluate predictive performance of the proposed methods on NIH Human Microbiome Project dataset. In particular we address the task of predicting relative abundance of Porphyromonas species in the oral cavity. In our empirical evaluation the proposed method outperforms several supervised regression techniques as well as leads to notable computational benefits when training the predictive model.
|
| Document type | Conference contribution |
| Note | INSPEC Accession Number: 14079664 |
| Language | English |
| Published at | https://doi.org/10.1109/BIBM.2013.6732550 |
| Permalink to this page | |
