The assessment of the developmental potential of stem cells is a crucial step towards their clinical application in regenerative medicine. It has been demonstrated that genome-wide expression profiles can predict the cellular differentiation stage by means of dimensionality reduction methods. Here we show that these techniques can be further strengthened to support decision making with i) a novel strategy for gene selection; ii) methods for combining the evidence from multiple data sets. Methods: We propose to exploit dimensionality reduction methods for the selection of genes specifically activated in different stages of differentiation. To obtain an integrated predictive model, the expression values of the selected genes from multiple data sets are combined. We investigated distinct approaches that either aggregate data sets or use learning ensembles. Results: We analyzed the performance of the proposed methods on six publicly available data sets. The selection procedure identified a reduced subset of genes whose expression values gave rise to an accurate stage prediction. The assessment of predictive accuracy demonstrated a high quality of predictions for most of the data integration methods presented. Conclusion: The experimental results highlighted the main potentials of proposed approaches. These include the ability to predict the true staging by combining multiple training data sets when this could not be inferred from a single data source, and to focus the analysis on a reduced list of genes of similar predictive performance.

Supporting Regenerative Medicine by Integrative Dimensionality Reduction.

MULAS, FRANCESCA;ZUPAN, BLAZ;BELLAZZI, RICCARDO
2012-01-01

Abstract

The assessment of the developmental potential of stem cells is a crucial step towards their clinical application in regenerative medicine. It has been demonstrated that genome-wide expression profiles can predict the cellular differentiation stage by means of dimensionality reduction methods. Here we show that these techniques can be further strengthened to support decision making with i) a novel strategy for gene selection; ii) methods for combining the evidence from multiple data sets. Methods: We propose to exploit dimensionality reduction methods for the selection of genes specifically activated in different stages of differentiation. To obtain an integrated predictive model, the expression values of the selected genes from multiple data sets are combined. We investigated distinct approaches that either aggregate data sets or use learning ensembles. Results: We analyzed the performance of the proposed methods on six publicly available data sets. The selection procedure identified a reduced subset of genes whose expression values gave rise to an accurate stage prediction. The assessment of predictive accuracy demonstrated a high quality of predictions for most of the data integration methods presented. Conclusion: The experimental results highlighted the main potentials of proposed approaches. These include the ability to predict the true staging by combining multiple training data sets when this could not be inferred from a single data source, and to focus the analysis on a reduced list of genes of similar predictive performance.
2012
Molecular Biology & Genetics considers all aspects of basic and applied genetics, including molecular genetics, prokaryotic and eukaryotic gene expression, mechanisms of mutagenesis, structure, function and regulation of genetic material. Also included are resources concerned with clinical genetics, patterns of inheritance, genetic cause, and screening and treatment of disease. Resources dealing specifically with developmentally regulated gene expression, or with signal transduction pathways that modulate gene expression at the cellular level are excluded and are covered in the Cell and Developmental Biology category.
Computer Science & Engineering includes resources on computer hardware and architecture, computer software, software engineering and design, computer graphics, programming languages, theoretical computing, computing methodologies, broad computing topics, and interdisciplinary computer applications.
Sì, ma tipo non specificato
Inglese
Internazionale
ELETTRONICO
51
4
Regenerative medicine; Data mining; Bioinformatics
4
info:eu-repo/semantics/article
262
Mulas, Francesca; Zagar, L; Zupan, Blaz; Bellazzi, Riccardo
1 Contributo su Rivista::1.1 Articolo in rivista
none
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11571/461866
Citazioni
  • ???jsp.display-item.citation.pmc??? 3
  • Scopus 4
  • ???jsp.display-item.citation.isi??? 4
social impact