- Shigeyuki Matsui: Department of Data Science, The Institute of Statistical Mathematics, Tokyo, Japan. smatsui@ism.ac.jp
The main role of high-throughput microarrays today is screening of relevant genes from a large pool of candidate genes. For prioritizing genes for subsequent studies, gene ranking based on the strength of the association with the phenotype is a relevant statistical output. In this article, we propose sample size calculations based on gene ranking and selection using the non-parametric Mann-Whitney-Wilcoxon statistic in microarray experiments. The use of the non-parametric statistic is expected to be advantageous in robustification in gene ranking for the deviation from normality and for possible scale change by using different platforms such as polymerase chain reaction-based platforms in subsequent studies in gene expression data. Application to the data set from a clinical study for lymphoma is given.