KMID : 0381120150370060525
|
|
Genes and Genomics 2015 Volume.37 No. 6 p.525 ~ p.535
|
|
Effects of omics data combinations on in silico tumor-normal tissue classification
|
|
Seok Ho-Sik
Seok Seung-Hwan Kim Jae-Bum
|
|
Abstract
|
|
|
A disease can be characterized by various attributes such as genomic, epigenetic, and transcriptomic features beyond physiological symptoms. The accumulation of vast datasets allows us to investigate the relative effectiveness of each omics data and their combinations for in silico analysis of diseases. Here, we employed a classification method with the well-established measure of information gain for the computational analysis of the effect of the aggregation of omics data, especially for the task of in silico classification of tumor-normal samples for bladder urothelial carcinoma and kidney renal papillary cell carcinoma. We observed that the combination of multi-omics data such as copy number variation, DNA methylation, RNA-Seq, and somatic mutations have beneficial effects. The quantitative analysis using information gain and various measures for classification-performance showed that the combination of multiple omics data improved the performance in general. The qualitative analysis referring previous researches also confirmed the relevance of genes with higher information gain to target diseases. Our results report that the combination of multiple omics data is beneficial and the information gain which focuses on the distribution of attributes across target domains could be useful as an indicator of the effect of each omics data on tumor-normal sample classification.
|
|
KEYWORD
|
|
Multiomics data, Copy number variation, DNA methylation, RNA-Seq, Somatic mutations, The cancer genome atlas (TCGA)
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|