Data Element Mapping in the Data Privacy Era

Romain Griffier; Sébastien Cossin; François Konschelle; Fleur Mougin; Vianney Jouhet

doi:10.3233/SHTI220469

Data Element Mapping in the Data Privacy Era

Stud Health Technol Inform. 2022 May 25:294:332-336. doi: 10.3233/SHTI220469.

Authors

Romain Griffier^{1

2}, Sébastien Cossin^{1

2}, François Konschelle^{1

2}, Fleur Mougin², Vianney Jouhet^{1

2}

Affiliations

¹ Bordeaux University Hospital, Public health, 33000 Bordeaux, France.
² Bordeaux University, Inserm U1219, Bordeaux Population Health, ERIAS team, 33000 Bordeaux, France.

PMID: 35612087
DOI: 10.3233/SHTI220469

Abstract

Secondary use of health data is made difficult in part because of large semantic heterogeneity. Many efforts are being made to align local terminologies with international standards. With increasing concerns about data privacy, we focused here on the use of machine learning methods to align biological data elements using aggregated features that could be shared as open data. A 3-step methodology (features engineering, blocking strategy and supervised learning) was proposed. The first results, although modest, are encouraging for the future development of this approach.

Keywords: LOINC; data element; machine learning; mapping.

MeSH terms

Machine Learning*
Privacy*