Challenges and approaches to statistical design and inference in high dimensional investigations

TitleChallenges and approaches to statistical design and inference in high dimensional investigations
Publication TypeBook Chapter
Year of Publication2009
AuthorsGadbury, GL, Garrett, KA, Allison, DB
EditorBelostotsky, DA
Book TitlePlant Systems Biology, Methods in Molecular Biology Series
Pagination181 -206
PublisherThe Humana Press Inc
CityTotowa, NJ
Accession NumberKNZ001256
KeywordsFDR, genomics, high-dimensional, microarray, multiple testing, statistics

Advances in modern technologies have facilitated high-dimensional experiments (HDEs) that generate tremendous amounts of genomic, proteomic, and other “omic” data. HDEs involving whole-genome sequences and polymorphisms, expression levels of genes, protein abundance measurements, and combinations thereof have become a vanguard for new analytic approaches to the analysis of HDE data. Such situations demand creative approaches to the processes of statistical inference, estimation, prediction, classification, and study design. The novel and challenging biological questions asked from HDE data have resulted in many specialized analytic techniques being developed. This chapter discusses some of the unique statistical challenges facing investigators studying high-dimensional biology and describes some approaches being developed by statistical scientists. We have included some focus on the increasing interest in questions involving testing multiple propositions simultaneously, appropriate inferential indicators for the types of questions biologists are interested in, and the need for replication of results across independent studies, investigators, and settings. A key consideration inherent throughout is the challenge in providing methods that a statistician judges to be sound and a biologist finds informative.