Biomedical informatics and machine learning for clinical genomics

James A Diao; Isaac S Kohane; Arjun K Manrai

doi:10.1093/hmg/ddy088

Biomedical informatics and machine learning for clinical genomics

Hum Mol Genet. 2018 May 1;27(R1):R29-R34. doi: 10.1093/hmg/ddy088.

Authors

James A Diao^{1

2}, Isaac S Kohane¹, Arjun K Manrai¹

Affiliations

¹ Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA.
² Department of Statistics and Data Science, Yale University, New Haven, CT 06520, USA.

Abstract

While tens of thousands of pathogenic variants are used to inform the many clinical applications of genomics, there remains limited information on quantitative disease risk for the majority of variants used in clinical practice. At the same time, rising demand for genetic counselling has prompted a growing need for computational approaches that can help interpret genetic variation. Such tasks include predicting variant pathogenicity and identifying variants that are too common to be penetrant. To address these challenges, researchers are increasingly turning to integrative informatics approaches. These approaches often leverage vast sources of data, including electronic health records and population-level allele frequency databases (e.g. gnomAD), as well as machine learning techniques such as support vector machines and deep learning. In this review, we highlight recent informatics and machine learning approaches that are improving our understanding of pathogenic variation and discuss obstacles that may limit their emerging role in clinical genomics.

Publication types

Research Support, N.I.H., Extramural
Review

MeSH terms

Computational Biology / trends*
Databases, Genetic
Genome, Human / genetics*
Genomics / trends*
Humans
Machine Learning / trends*

Abstract

Publication types

MeSH terms

Grants and funding