Between-group analysis of microarray data

C. Aedín Culhane, Guy Perrière, Elizabeth C. Considine, Thomas G. Cotter, Desmond G. Higgins

Research output: Contribution to journalArticlepeer-review

Abstract

Motivation: Most supervised classification methods are limited by the requirement for more cases than variables. In microarray data the number of variables (genes) far exceeds the number of cases (arrays), and thus filtering and pre-selection of genes is required. We describe the application of Between Group Analysis (BGA) to the analysis of microarray data. A feature of BGA is that it can be used when the number of variables (genes) exceeds the number of cases (arrays). BGA is based on carrying out an ordination of groups of samples, using a standard method such as Correspondence Analysis (COA), rather than an ordination of the individual microarray samples. As such, it can be viewed as a method of carrying out COA with grouped data. Results: We illustrate the power of the method using two cancer data sets. In both cases, we can quickly and accurately classify test samples from any number of specified a priori groups and identify the genes which characterize these groups. We obtained very high rates of correct classification, as determined by jack-knife or validation experiments with training and test sets. The results are comparable to those from other methods in terms of accuracy but the power and flexibility of BGA make it an especially attractive method for the analysis of microarray cancer data.

Original languageEnglish
Pages (from-to)1600-1608
Number of pages9
JournalBioinformatics
Volume18
Issue number12
DOIs
Publication statusPublished - 1 Dec 2002
Externally publishedYes

Fingerprint

Dive into the research topics of 'Between-group analysis of microarray data'. Together they form a unique fingerprint.

Cite this