KMID : 0578320080250020279
|
|
Molecules and Cells 2008 Volume.25 No. 2 p.279 ~ p.288
|
|
Clustering Approaches to Identifying Gene Expression Patterns from DNA Microarray Data
|
|
Do Jin-Hwan
Choi Dong-Kug
|
|
Abstract
|
|
|
The analysis of microarray data is essential for large amounts of gene expression data. In this review we focus on clustering techniques. The biological rationale for this approach is the fact that many co-expressed genes are co-regulated, and identifying co-expressed genes could aid in functional annotation of novel genes, de novo identification of transcription factor binding sites and elucidation of complex biological pathways. Co-expressed genes are usually identified in microarray experiments by clustering techniques. There are many such methods, and the results obtained even for the same datasets may vary considerably depending on the algorithms and metrics for dissimilarity measures used, as well as on user-selectable parameters such as desired number of clusters and initial values. Therefore, biologists who want to interpret microarray data should be aware of the weakness and strengths of the clustering methods used. In this review, we survey the basic principles of clustering of DNA microarray data from crisp clustering algorithms such as hierarchical clustering, K-means and self-organizing maps, to complex clustering algorithms like fuzzy clustering.
|
|
KEYWORD
|
|
Co-Expression, DNA Microarray, Fuzzy Clustering, Hierarchical Clustering, K-means, Self-organizing Map
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|
|