Research in severe traumatic brain injury (TBI) has historically been limited by studies with relatively small sample sizes that result in low power to detect small, yet clinically meaningful outcomes. Data sharing and integration from existing sources hold promise to yield larger more robust sample sizes that improve the potential signal and generalizability of important research questions. However, curation and harmonization of data of different types and of disparate provenance is challenging. We report our approach and experience integrating multiple TBI data sets containing collected physiological data, including both expected and unexpected challenges encountered in the integration process. Our harmonized data set included data on 1536 patients from the Citicoline Brain Injury Treatment Trial (COBRIT), Effect of erythropoietin and transfusion threshold on neurological recovery after traumatic brain injury: a randomized clinical trial (EPO Severe TBI), BEST-TRIP, Progesterone for the Treatment of Traumatic Brain Injury III Clinical Trial (ProTECT III), Transforming Research and Clinical Knowledge in Traumatic brain Injury (TRACK-TBI), Brain Oxygen Optimization in Severe Traumatic Brain Injury Phase-II (BOOST-2), and Ben Taub General Hospital (BTGH) Research Database studies. We conclude with process recommendations for data acquisition for future prospective studies to aid integration of these data with existing studies. These recommendations include using common data elements whenever possible, a standardized recording system for labeling and timing of high-frequency physiological data, and secondary use of studies in systems such as Federal Interagency Traumatic Brain Injury Research Informatics System (FITBIR), to engage investigators who collected the original data.
Keywords: FITBIR; data curation; data harmonization; data integration; physiological data; traumatic brain injury.