Reproducible evaluation of classification methods in Alzheimer's disease: Framework and application to MRI and PET data

Jorge Samper-González; Ninon Burgos; Simona Bottani; Sabrina Fontanella; Pascal Lu; Arnaud Marcoux; Alexandre Routier; Jérémy Guillon; Michael Bacci; Junhao Wen; Anne Bertrand; Hugo Bertin; Marie-Odile Habert; Stanley Durrleman; Theodoros Evgeniou; Olivier Colliot; Alzheimer's Disease Neuroimaging Initiative; Australian Imaging Biomarkers and Lifestyle flagship study of ageing

doi:10.1016/j.neuroimage.2018.08.042

Reproducible evaluation of classification methods in Alzheimer's disease: Framework and application to MRI and PET data

Neuroimage. 2018 Dec:183:504-521. doi: 10.1016/j.neuroimage.2018.08.042. Epub 2018 Aug 18.

Authors

Jorge Samper-González¹, Ninon Burgos², Simona Bottani³, Sabrina Fontanella³, Pascal Lu³, Arnaud Marcoux³, Alexandre Routier³, Jérémy Guillon³, Michael Bacci², Junhao Wen², Anne Bertrand⁴, Hugo Bertin⁵, Marie-Odile Habert⁶, Stanley Durrleman², Theodoros Evgeniou⁷, Olivier Colliot⁸; Alzheimer's Disease Neuroimaging Initiative; Australian Imaging Biomarkers and Lifestyle flagship study of ageing

Affiliations

¹ Inria, ARAMIS Project-team, F-75013, Paris, France; Institut du Cerveau et de la Moelle épinière, F-75013, Paris, France; Inserm, U1127, F-75013, Paris, France; CNRS, UMR 7225, F-75013, Paris, France; Sorbonne Université, F-75013, Paris, France. Electronic address: jorge.samper-gonzalez@inria.fr.
² Inria, ARAMIS Project-team, F-75013, Paris, France; Institut du Cerveau et de la Moelle épinière, F-75013, Paris, France; Inserm, U1127, F-75013, Paris, France; CNRS, UMR 7225, F-75013, Paris, France; Sorbonne Université, F-75013, Paris, France.
³ Institut du Cerveau et de la Moelle épinière, F-75013, Paris, France; Inserm, U1127, F-75013, Paris, France; CNRS, UMR 7225, F-75013, Paris, France; Sorbonne Université, F-75013, Paris, France; Inria, ARAMIS Project-team, F-75013, Paris, France.
⁴ Institut du Cerveau et de la Moelle épinière, F-75013, Paris, France; Inserm, U1127, F-75013, Paris, France; CNRS, UMR 7225, F-75013, Paris, France; Sorbonne Université, F-75013, Paris, France; Inria, ARAMIS Project-team, F-75013, Paris, France; AP-HP, Department of Neuroradiology, Pitié-Salpêtrière Hospital, Paris, France.
⁵ Laboratoire d'Imagerie Biomédicale, Inserm, U 1146, CNRS, UMR 7371, Sorbonne Université, F-75013, Paris, France.
⁶ Laboratoire d'Imagerie Biomédicale, Inserm, U 1146, CNRS, UMR 7371, Sorbonne Université, F-75013, Paris, France; AP-HP, Department of Nuclear Medicine, Pitié-Salpêtrière Hospital, Paris, France.
⁷ INSEAD, Bd de Constance, 77305, Fontainebleau, France.
⁸ Institut du Cerveau et de la Moelle épinière, F-75013, Paris, France; Inserm, U1127, F-75013, Paris, France; CNRS, UMR 7225, F-75013, Paris, France; Sorbonne Université, F-75013, Paris, France; Inria, ARAMIS Project-team, F-75013, Paris, France; AP-HP, Department of Neuroradiology, Pitié-Salpêtrière Hospital, Paris, France; AP-HP, Department of Neurology, Pitié-Salpêtrière Hospital, Paris, France. Electronic address: olivier.colliot@upmc.fr.

PMID: 30130647
DOI: 10.1016/j.neuroimage.2018.08.042

Abstract

A large number of papers have introduced novel machine learning and feature extraction methods for automatic classification of Alzheimer's disease (AD). However, while the vast majority of these works use the public dataset ADNI for evaluation, they are difficult to reproduce because different key components of the validation are often not readily available. These components include selected participants and input data, image preprocessing and cross-validation procedures. The performance of the different approaches is also difficult to compare objectively. In particular, it is often difficult to assess which part of the method (e.g. preprocessing, feature extraction or classification algorithms) provides a real improvement, if any. In the present paper, we propose a framework for reproducible and objective classification experiments in AD using three publicly available datasets (ADNI, AIBL and OASIS). The framework comprises: i) automatic conversion of the three datasets into a standard format (BIDS); ii) a modular set of preprocessing pipelines, feature extraction and classification methods, together with an evaluation framework, that provide a baseline for benchmarking the different components. We demonstrate the use of the framework for a large-scale evaluation on 1960 participants using T1 MRI and FDG PET data. In this evaluation, we assess the influence of different modalities, preprocessing, feature types (regional or voxel-based features), classifiers, training set sizes and datasets. Performances were in line with the state-of-the-art. FDG PET outperformed T1 MRI for all classification tasks. No difference in performance was found for the use of different atlases, image smoothing, partial volume correction of FDG PET images, or feature type. Linear SVM and L2-logistic regression resulted in similar performance and both outperformed random forests. The classification performance increased along with the number of subjects used for training. Classifiers trained on ADNI generalized well to AIBL and OASIS. All the code of the framework and the experiments is publicly available: general-purpose tools have been integrated into the Clinica software (www.clinica.run) and the paper-specific code is available at: https://gitlab.icm-institute.org/aramislab/AD-ML.

Keywords: Alzheimer's disease; Classification; Magnetic resonance imaging; Open-source; Positron emission tomography; Reproducibility.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Aged
Aged, 80 and over
Alzheimer Disease / diagnostic imaging*
Alzheimer Disease / metabolism
Alzheimer Disease / pathology
Atlases as Topic
Data Interpretation, Statistical*
Datasets as Topic*
Female
Fluorodeoxyglucose F18
Humans
Image Processing, Computer-Assisted / methods*
Machine Learning*
Magnetic Resonance Imaging / methods*
Male
Middle Aged
Neuroimaging / methods*
Positron-Emission Tomography / methods*
Radiopharmaceuticals

Substances

Radiopharmaceuticals
Fluorodeoxyglucose F18