Automated Classification of Consumer Health Information Needs in Patient Portal Messages

AMIA Annu Symp Proc. 2015 Nov 5:2015:1861-70. eCollection 2015.

Abstract

Patients have diverse health information needs, and secure messaging through patient portals is an emerging means by which such needs are expressed and met. As patient portal adoption increases, growing volumes of secure messages may burden healthcare providers. Automated classification could expedite portal message triage and answering. We created four automated classifiers based on word content and natural language processing techniques to identify health information needs in 1000 patient-generated portal messages. Logistic regression and random forest classifiers detected single information needs well, with area under the curves of 0.804-0.914. A logistic regression classifier accurately found the set of needs within a message, with a Jaccard index of 0.859 (95% Confidence Interval: (0.847, 0.871)). Automated classification of consumer health information needs expressed in patient portal messages is feasible and may allow direct linking to relevant resources or creation of institutional resources for commonly expressed needs.

MeSH terms

  • Consumer Health Information*
  • Electronic Health Records*
  • Health Resources
  • Health Services Needs and Demand / classification*
  • Humans
  • Natural Language Processing*
  • Patient Portals*