Identifying dementia cases with routinely collected health data: A systematic review

Tim Wilkinson; Amanda Ly; Christian Schnier; Kristiina Rannikmäe; Kathryn Bush; Carol Brayne; Terence J Quinn; Cathie L M Sudlow; UK Biobank Neurodegenerative Outcomes Group and Dementias Platform UK

doi:10.1016/j.jalz.2018.02.016

Identifying dementia cases with routinely collected health data: A systematic review

Alzheimers Dement. 2018 Aug;14(8):1038-1051. doi: 10.1016/j.jalz.2018.02.016. Epub 2018 Apr 3.

Authors

Tim Wilkinson¹, Amanda Ly², Christian Schnier², Kristiina Rannikmäe³, Kathryn Bush³, Carol Brayne⁴, Terence J Quinn⁵, Cathie L M Sudlow⁶; UK Biobank Neurodegenerative Outcomes Group and Dementias Platform UK

Affiliations

¹ Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, Scotland; Usher Institute of Population Health Sciences and Informatics, Nine Bioquarter, Edinburgh, Scotland. Electronic address: tim.wilkinson@ed.ac.uk.
² Usher Institute of Population Health Sciences and Informatics, Nine Bioquarter, Edinburgh, Scotland.
³ Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, Scotland; Usher Institute of Population Health Sciences and Informatics, Nine Bioquarter, Edinburgh, Scotland.
⁴ Institute of Public Health, Cambridge University, Cambridge, UK.
⁵ Institute of Cardiovascular and Medical Sciences, University of Glasgow, Glasgow, Scotland.
⁶ Centre for Clinical Brain Sciences, University of Edinburgh, Edinburgh, Scotland; Usher Institute of Population Health Sciences and Informatics, Nine Bioquarter, Edinburgh, Scotland; UK Biobank, Coordinating Centre, Stockport, UK.

Abstract

Introduction: Prospective, population-based studies can be rich resources for dementia research. Follow-up in many such studies is through linkage to routinely collected, coded health-care data sets. We evaluated the accuracy of these data sets for dementia case identification.

Methods: We systematically reviewed the literature for studies comparing dementia coding in routinely collected data sets to any expert-led reference standard. We recorded study characteristics and two accuracy measures-positive predictive value (PPV) and sensitivity.

Results: We identified 27 eligible studies with 25 estimating PPV and eight estimating sensitivity. Study settings and methods varied widely. For all-cause dementia, PPVs ranged from 33%-100%, but 16/27 were >75%. Sensitivities ranged from 21% to 86%. PPVs for Alzheimer's disease (range 57%-100%) were generally higher than those for vascular dementia (range 19%-91%).

Discussion: Linkage to routine health-care data can achieve a high PPV and reasonable sensitivity in certain settings. Given the heterogeneity in accuracy estimates, cohorts should ideally conduct their own setting-specific validation.

Keywords: Alzheimer's disease; Clinical coding; Cohort studies; Dementia; Epidemiology; Positive predictive value; Predictive value of tests; Prospective studies; Sensitivity; Vascular.

Publication types

Research Support, Non-U.S. Gov't
Systematic Review

MeSH terms

Alzheimer Disease / diagnosis*
Alzheimer Disease / epidemiology
Clinical Coding / standards
Data Collection / standards*
Delivery of Health Care*
Dementia, Vascular / diagnosis
Dementia, Vascular / epidemiology
Humans
Sensitivity and Specificity

Grants and funding

MR/P001823/1/MRC_/Medical Research Council/United Kingdom