Promises and Pitfalls of High-Throughput Biological Assays

Methods Mol Biol. 2016:1415:225-43. doi: 10.1007/978-1-4939-3572-7_12.

Abstract

This chapter discusses some of the pitfalls encountered when performing biomedical research involving high-throughput "omics" data and presents some strategies and guidelines that researchers should follow when undertaking such studies. We discuss common errors in experimental design and data analysis that lead to irreproducible and non-replicable research and provide some guidelines to avoid these common mistakes so that researchers may have confidence in study outcomes, even if the results are negative. We discuss the importance of ranking and prespecifying hypotheses, performing power analysis, careful experimental design, and preplanning of statistical analyses in order to avoid the "fishing expedition" data analysis strategy, which is doomed to fail. The impact of multiple testing on false-positive rates is discussed, particularly in the context of the analysis of high-throughput data, and methods to correct for it are presented, as well as approaches to detect and correct for experimental biases and batch effects, which often plague high-throughput assays. We highlight the importance of sharing data and analysis code to facilitate reproducibility and present tools and software that are appropriate for this purpose.

Keywords: Batch effects; Confounding; Experimental design; Multiple testing; Replicability; Reproducibility; Statistical analysis plan.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • Biomedical Research
  • Guidelines as Topic
  • High-Throughput Screening Assays / methods*
  • High-Throughput Screening Assays / standards
  • Humans
  • Information Dissemination
  • Reproducibility of Results
  • Research Design