EMS3D-KITTI: Synthetic 3D dataset in KITTI format with a fair distribution of Emergency Medical Services vehicles for autodrive AI model training

Data Brief. 2024 Dec 11:58:111221. doi: 10.1016/j.dib.2024.111221. eCollection 2025 Feb.

Abstract

Contemporary research in 3D object detection for autonomous driving primarily focuses on identifying standard entities like vehicles and pedestrians. However, the need for large, precisely labelled datasets limits the detection of specialized and less common objects, such as Emergency Medical Service (EMS) and law enforcement vehicles. To address this, we leveraged the Car Learning to Act (CARLA) simulator to generate and fairly distribute rare EMS vehicles, automatically labelling these objects in 3D point cloud data. This enriched dataset, organized in the KITTI 3D object detection benchmark format by the Karlsruhe Institute of Technology and the Toyota Technological Institute, improves its utility for training and evaluating autonomous vehicle systems. To bridge the gap between simulated and real-world scenarios, our methodology integrates a wide range of scenarios simulation in CARLA, including variations in weather conditions, human presence, and different environmental settings. This approach enhances the realism and robustness of the dataset, making it more applicable to practical autonomous driving scenarios. The data provided in this article offers a valuable resource for researchers, industry professionals, and stakeholders interested in advancing autonomous vehicle technologies and improving emergency vehicle detection. Furthermore, this dataset contributes to broader efforts in road safety and the development of AI systems capable of handling specialized vehicle identification in real-world applications.

Keywords: Autonomous vehicles; CARLA simulator; Deep learning; Machine learning; Point cloud; Traffic scenarios.