On the probability of correct selection for large k populations, with
application to Microarray Data
Xinping Cui
(UC Riverside)
One frontier of modern statistical research is the "multiple testing
problem" arising from data sets with large k (>1000) populations, e.g.
differential expression tests on thousands of genes through microarrays.
In this talk, we propose "d-best" and "G-best" estimators on the
probability of correctly selecting one or more "best" genes out of k
genes under testing and demonstrate how it can be employed as a
"meta-method" to discriminate between competing multiple hypothesis
testing approaches (eg, maxT, FDR, etc) in identifying differentially
expressed genes. We will also explore the use of this new estimator in
gene selection in cancer classification.