Human Molecular Genetics, 2003, Vol. 12, No. 8 823-836
DOI: 10.1093/hmg/ddg093
© 2003 Oxford University Press
Pattern recognition in gene expression profiling using DNA array: a comparative study of different statistical methods applied to cancer classification
CRIBI Biotechnology Centre and Dipartimento di Biologia, Università degli Studi di Padova, Via Ugo Bassi 58/B, 35121 Padova, Italy
Received November 11, 2002; Accepted February 7, 2003
Large-scale parallel measurements of the expression of many thousands genes are now available with high-density array made with collections of cDNA fragments, or oligonucleotide corresponding to different transcripts. These technologies have been applied to cancer investigations since the availability of such a large number of markers makes DNA array a powerful diagnostic tool for tumour and patient classification. Over the last two years, a series of computational tools have been developed for the analysis of different aspects of gene profiling. Our work tries to compare a series of supervised statistical techniques on the basis of their ability to correctly classify different types of tumours. A simulation approach was initially used to control the huge source of variation among and between patients, and to evaluate the ability of algorithms to classify tumours in relation to different types of experimental variables. Different techniques for reduction of data dimension were then added to the discriminant analysis and compared according to their ability to capture the main genetic information. The simulation results have been tested by applying the selected classification algorithms to two experimental microarray datasets of human cancers, and by measuring the correspondent rates of misclassification. Our analyses identify in these datasets a series of genes principally involved in tumour characterization. The functional role of these discriminant transcripts is discussed.
* To whom correspondence should be addressed. Tel: +39 0498276221; Fax: +39 0498276259; Email: lanfra{at}cribi.unipd.it
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
I. Medina, D. Montaner, J. Tarraga, and J. Dopazo Prophet, a web-based tool for class prediction using microarray data Bioinformatics, February 1, 2007; 23(3): 390 - 391. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. F. A. Wessels, M. J. T. Reinders, A. A. M. Hart, C. J. Veenman, H. Dai, Y. D. He, and L. J. v. Veer A protocol for building and evaluating predictors of disease state based on microarray data Bioinformatics, October 1, 2005; 21(19): 3755 - 3762. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Liang, B. Tayo, X. Cai, and A. Kelemen Differential and trajectory methods for time course gene expression data Bioinformatics, July 1, 2005; 21(13): 3009 - 3016. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Statnikov, C. F. Aliferis, I. Tsamardinos, D. Hardin, and S. Levy A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis Bioinformatics, March 1, 2005; 21(5): 631 - 643. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-d. Chen, S. Zheng, J.-k. Yu, and X. Hu Artificial Neural Networks Analysis of Surface-Enhanced Laser Desorption/Ionization Mass Spectra of Serum Protein Pattern Distinguishes Colorectal Cancer from Healthy Population Clin. Cancer Res., December 15, 2004; 10(24): 8380 - 8385. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Herrero, J. M. Vaquerizas, F. Al-Shahrour, L. Conde, A. Mateos, J. S. R. Diaz-Uriarte, and J. Dopazo New challenges in gene expression data analysis and the extended GEPAS Nucleic Acids Res., July 1, 2004; 32(suppl_2): W485 - W491. [Abstract] [Full Text] [PDF] |
||||


