Transformers and large language models in healthcare: A review

Subhash Nerella; Sabyasachi Bandyopadhyay; Jiaqing Zhang; Miguel Contreras; Scott Siegel; Aysegul Bumin; Brandon Silva; Jessica Sena; Benjamin Shickel; Azra Bihorac; Kia Khezeli; Parisa Rashidi

doi:10.1016/j.artmed.2024.102900

Transformers and large language models in healthcare: A review

Artif Intell Med. 2024 Aug:154:102900. doi: 10.1016/j.artmed.2024.102900. Epub 2024 Jun 5.

Authors

Affiliations

¹ Department of Biomedical Engineering, University of Florida, Gainesville, United States.
² Department of Medicine, Stanford University, Stanford, United States.
³ Department of Electrical and Computer Engineering, University of Florida, Gainesville, United States.
⁴ Department of Computer and Information Science and Engineering, University of Florida, Gainesville, United States.
⁵ Department Of Computer Science, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
⁶ Department of Medicine, University of Florida, Gainesville, United States.
⁷ Department of Biomedical Engineering, University of Florida, Gainesville, United States. Electronic address: parisa.rashidi@ufl.edu.

Abstract

With Artificial Intelligence (AI) increasingly permeating various aspects of society, including healthcare, the adoption of the Transformers neural network architecture is rapidly changing many applications. Transformer is a type of deep learning architecture initially developed to solve general-purpose Natural Language Processing (NLP) tasks and has subsequently been adapted in many fields, including healthcare. In this survey paper, we provide an overview of how this architecture has been adopted to analyze various forms of healthcare data, including clinical NLP, medical imaging, structured Electronic Health Records (EHR), social media, bio-physiological signals, biomolecular sequences. Furthermore, which have also include the articles that used the transformer architecture for generating surgical instructions and predicting adverse outcomes after surgeries under the umbrella of critical care. Under diverse settings, these models have been used for clinical diagnosis, report generation, data reconstruction, and drug/protein synthesis. Finally, we also discuss the benefits and limitations of using transformers in healthcare and examine issues such as computational cost, model interpretability, fairness, alignment with human values, ethical implications, and environmental impact.

Keywords: Electronic Health Records; Healthcare; Large Language Models; Medical Imaging; Natural Language Processing; Transformers.

Publication types

Review
Research Support, Non-U.S. Gov't

MeSH terms

Artificial Intelligence
Deep Learning*
Delivery of Health Care / organization & administration
Electronic Health Records
Humans
Natural Language Processing*
Neural Networks, Computer

Abstract

Publication types

MeSH terms

Grants and funding