Biomedical informatics and machine learning for clinical genomics

Hum Mol Genet. 2018 May 1;27(R1):R29-R34. doi: 10.1093/hmg/ddy088.

Abstract

While tens of thousands of pathogenic variants are used to inform the many clinical applications of genomics, there remains limited information on quantitative disease risk for the majority of variants used in clinical practice. At the same time, rising demand for genetic counselling has prompted a growing need for computational approaches that can help interpret genetic variation. Such tasks include predicting variant pathogenicity and identifying variants that are too common to be penetrant. To address these challenges, researchers are increasingly turning to integrative informatics approaches. These approaches often leverage vast sources of data, including electronic health records and population-level allele frequency databases (e.g. gnomAD), as well as machine learning techniques such as support vector machines and deep learning. In this review, we highlight recent informatics and machine learning approaches that are improving our understanding of pathogenic variation and discuss obstacles that may limit their emerging role in clinical genomics.

Publication types

  • Research Support, N.I.H., Extramural
  • Review

MeSH terms

  • Computational Biology / trends*
  • Databases, Genetic
  • Genome, Human / genetics*
  • Genomics / trends*
  • Humans
  • Machine Learning / trends*