Personalized breast cancer onset prediction from lifestyle and health history information

PLoS One. 2022 Dec 19;17(12):e0279174. doi: 10.1371/journal.pone.0279174. eCollection 2022.

Abstract

We propose a method to predict when a woman will develop breast cancer (BCa) from her lifestyle and health history features. To address this objective, we use data from the Alberta's Tomorrow Project of 18,288 women to train Individual Survival Distribution (ISD) models to predict an individual's Breast-Cancer-Onset (BCaO) probability curve. We show that our three-step approach-(1) filling missing data with multiple imputations by chained equations, followed by (2) feature selection with the multivariate Cox method, and finally, (3) using MTLR to learn an ISD model-produced the model with the smallest L1-Hinge loss among all calibrated models with comparable C-index. We also identified 7 actionable lifestyle features that a woman can modify and illustrate how this model can predict the quantitative effects of those changes-suggesting how much each will potentially extend her BCa-free time. We anticipate this approach could be used to identify appropriate interventions for individuals with a higher likelihood of developing BCa in their lifetime.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Breast Neoplasms*
  • Female
  • Humans
  • Life Style
  • Probability
  • Surveys and Questionnaires

Grants and funding

Alberta Health, Alberta, Canada, Grace Shen-Tu Canadian Breast Cancer Foundation, Prairies/NWT Chapter, Canada, Sambasivarao Damaraju Alberta Cancer Foundation, Alberta, Canada, Grace Shen-Tu Canadian Partnership Against Cancer and Health Canada, Ontario, Canada, Grace Shen-Tu Alberta Health Services, Alberta, Canada, Grace Shen-Tu Alberta Machine Intelligence Institute, Russell Greiner Natural Sciences and Engineering Research Council of Canada, Russell Greiner