DiaData: an integrated large dataset for type 1 diabetes and hypoglycemia research
Translated title
DiaData: Ein integrierter großer Datensatz für die Forschung zu Typ-1-Diabetes und Hypoglykämie
Publication date
2025-06-03
Document type
Research data
Author
Organisational unit
Publisher
Universitätsbibliothek der HSU/UniBw H
Part of the university bibliography
✅
Files openHSU_20048_3 (6.38 MB) openHSU_20048_2 (380.96 MB)
subdatabaseII.csv.zip
subdatabaseI.csv.zip
Language
English
Keyword
DiaData
Type 1 diabetes
Dataset
CGM
Abstract
DiaData integrates 13 different datasets and presents a large continuous glucose monitoring (CGM) dataset comprising data from individuals with Type 1 Diabetes (T1D) across various age groups. The Maindatabase contains CGM measurements of all 1720 subjects. From this, two subsets are extracted: Subdatabase I includes CGM data and demographics of age and sex for 1306 subjects, while Subdatabase II includes CGM and heart rate data for a subset of 51 subjects.
Description
DiaData is an integrated dataset that combines Continuous Glucose Monitoring (CGM) measurements from 13 different datasets, all collected from patients with Type 1 Diabetes (T1D). DiaData is provided in .csv format, where each row represents a single CGM measurement. The Maindatabase includes CGM data for all subjects, with the following columns: timestamp (ts), patient identifier (PtID), glucose value (GlucoseCGM), and the source database name (Database). Subdatabase I adds demographic information, including Age, AgeGroup, and Sex, for a subset of subjects. Subdatabase II contains CGM data combined with heart rate measurements for a subset of subjects, incorporating an additional column for heart rate (HR).
The datasets used in this study were obtained from a variety of third-party sources. The sources of datasets, the code for data preprocessing and exploration can be found in https://github.com/Beyza-Cinar/DiaData.
The datasets used in this study were obtained from a variety of third-party sources. The sources of datasets, the code for data preprocessing and exploration can be found in https://github.com/Beyza-Cinar/DiaData.
The sources of the data are
the D1NAMO dataset (https://doi.org/10.5281/zenodo.5651217),
the HUPA-UCM Diabetes Dataset (doi: 10.17632/3hbcscwz44.1),
the Diabetes Adolescents Time Series with Heart Rate dataset (https://github.com/ictinnovaties-zorg/dataset-diabetes-adolescents-time-series-with-heart-rate/tree/main/data-csv),
the ShanghaiT1DM dataset (https://doi.org/10.6084/m9.figshare.20444397.v3),
the T1GDUJA dataset (https://doi.org/10.5281/zenodo.11284018),
the CITY dataset (https://public.jaeb.org/dataset/565),
the ReplaceBG dataset (https://public.jaeb.org/dataset/546),
the RT-CGM dataset (https://public.jaeb.org/dataset/563),
the DLCP3 dataset (https://public.jaeb.org/dataset/573),
the SENCE dataset (https://public.jaeb.org/dataset/537),
the Severe Hypoglycemia in Older Adults with Type 1 Diabetes dataset (https://public.jaeb.org/dataset/537),
the WISDM dataset (https://public.jaeb.org/dataset/564),
and the PEDAP dataset (https://public.jaeb.org/dataset/599).
The sources of subsets of the data are the Barbara Davis Center, Jaeb Center for Health Research, Joslin Diabetes Center, T1D Exchange, University of Colorado, and University of Virginia. The analyses, content, and conclusions presented herein are solely the responsibility of the authors and have not been reviewed or approved by the before mentioned institutions.
the D1NAMO dataset (https://doi.org/10.5281/zenodo.5651217),
the HUPA-UCM Diabetes Dataset (doi: 10.17632/3hbcscwz44.1),
the Diabetes Adolescents Time Series with Heart Rate dataset (https://github.com/ictinnovaties-zorg/dataset-diabetes-adolescents-time-series-with-heart-rate/tree/main/data-csv),
the ShanghaiT1DM dataset (https://doi.org/10.6084/m9.figshare.20444397.v3),
the T1GDUJA dataset (https://doi.org/10.5281/zenodo.11284018),
the CITY dataset (https://public.jaeb.org/dataset/565),
the ReplaceBG dataset (https://public.jaeb.org/dataset/546),
the RT-CGM dataset (https://public.jaeb.org/dataset/563),
the DLCP3 dataset (https://public.jaeb.org/dataset/573),
the SENCE dataset (https://public.jaeb.org/dataset/537),
the Severe Hypoglycemia in Older Adults with Type 1 Diabetes dataset (https://public.jaeb.org/dataset/537),
the WISDM dataset (https://public.jaeb.org/dataset/564),
and the PEDAP dataset (https://public.jaeb.org/dataset/599).
The sources of subsets of the data are the Barbara Davis Center, Jaeb Center for Health Research, Joslin Diabetes Center, T1D Exchange, University of Colorado, and University of Virginia. The analyses, content, and conclusions presented herein are solely the responsibility of the authors and have not been reviewed or approved by the before mentioned institutions.
Version
Not applicable (or unknown)
Access right on openHSU
Open access