Identifying common and distinctive processes underlying multiset data

K. van Deun; A.K. Smilde; L. Thorrez; H.A.L. Kiers; I. van Mechelen

doi:https://doi.org/10.1016/j.chemolab.2013.07.005

Identifying common and distinctive processes underlying multiset data

Authors	K. van Deun A.K. Smilde L. Thorrez H.A.L. Kiers I. van Mechelen
Publication date	2013
Journal	Chemometrics and Intelligent Laboratory Systems
Volume \| Issue number	129
Pages (from-to)	40-51
Organisations	Faculty of Science (FNWI) - Swammerdam Institute for Life Sciences (SILS)
Abstract	In many research domains it has become a common practice to rely on multiple sources of data to study the same object of interest. Examples include a systems biology approach to immunology with collection of both gene expression data and immunological readouts for the same set of subjects, and the use of several high-throughput techniques for the same set of fermentation batches. A major challenge is to find the processes underlying such multiset data and to disentangle therein the common processes from those that are distinctive for a specific source. Several integrative methods have been proposed to address this challenge including canonical correlation analysis, simultaneous component analysis, OnPLS, generalized singular value decomposition, DISCO-SCA, and ECO-POWER. To get a better understanding 1) of the methods with respect to finding common and distinctive components and 2) of the relations between these methods, this paper brings the methods together and compares them both on a theoretical level and in terms of analyses of high-dimensional micro-array gene expression data obtained from subjects vaccinated against influenza.
Document type	Article
Language	English
Published at	https://doi.org/10.1016/j.chemolab.2013.07.005 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Identifying common and distinctive processes underlying multiset data