openHSU logo
Log In(current)
  1. Home
  2. Helmut-Schmidt-University / University of the Federal Armed Forces Hamburg
  3. Publications
  4. 3 - Publication references (without full text)
  5. A review on machine learning approaches for the prediction of glucose levels and hypoglycemia

A review on machine learning approaches for the prediction of glucose levels and hypoglycemia

Publication date
2026-01-09
Document type
Preprint
Author
Cinar, Beyza  
van den Boom, Louisa
Maleshkova, Maria  
Organisational unit
Data Engineering  
DOI
10.48550/arXiv.2601.11615
URI
https://openhsu.ub.hsu-hh.de/handle/10.24405/22145
Publisher
arXiv
Part of the university bibliography
✅
Additional Information
Language
English
Keyword
Classification
Diabetes type 1
Hypoglycemia
Machine learning
Long-term
Short-term
Regression
Abstract
Type 1 Diabetes (T1D) is an autoimmune disease leading to insulin insufficiency. Thus, patients require lifelong insulin therapy, which has a side effect of hypoglycemia. Hypoglycemia is a critical state of decreased blood glucose levels (BGL) below 70 mg/dL and is associated with increased risk of mortality. Machine learning (ML) models can improve diabetes management by predicting hypoglycemia and providing optimal prevention methods. ML models are classified into regression and classification based, that forecast glucose levels and identify events based on defined labels, respectively. This review investigates state-of-the-art models trained on data of continuous glucose monitoring (CGM) devices from patients with T1D. We compare the models' performance across short-term (15 to 120 min) and long term (3 to more than 24 hours) prediction horizons (PHs). Particularly, we explore: 1) How much in advance can glucose values or a hypoglycemic event be accurately predicted? 2) Which models have the best performance? 3) Which factors impact the performance? and 4) Does personalization increase performance? The results show that 1) a PH of up to 1 hour provides the best results. 2) Conventional ML methods yield the best results for classification and DL for regression. A single model cannot adequately classify across multiple PHs. 3) The model performance is influenced by multivariate datasets and the input sequence length (ISL). 4) Personal data enhances performance but due to limited data quality population-based models are preferred.
Version
Submitted version under review
Access right on openHSU
Metadata only access

  • Privacy policy
  • Send Feedback
  • Imprint