Background: Managing blood glucose levels in Type 1 Diabetes Mellitus (T1DM) is essential to prevent complications. Traditional insulin delivery methods often require significant patient involvement, limiting automation. Reinforcement Learning (RL)-based controllers offer a promising approach for improving automated insulin administration. Methods: We propose a Dual Proximal Policy Optimization (Dual PPO) controller for personalized insulin delivery in a hybrid closed-loop system. The controller optimizes patient-specific insulin bounds through a grid search on pre-trained models to manage both hyperglycemia and hypoglycemia. A safe-control mechanism prevents insulin administration when glucose levels drop below a predefined threshold. The system was evaluated on 10 in silico adult patients using the UVA/Padova simulator, with five-day randomized meal scenarios. Results: The Dual PPO controller significantly improved Time in Range (TIR) (69.30% ±1.61) compared to a single PPO model (61.69% ±1.54). The system effectively reduced severe hyperglycemia while maintaining a low incidence of severe hypoglycemia. Unlike conventional open-loop methods such as Basal-Bolus (BBC) and Proportional-Integral-Derivative (PIDC) controllers, our system requires minimal patient interaction, eliminating the need for carbohydrate estimation. Conclusions: The Dual PPO controller enhances personalized insulin delivery in T1DM, improving glycemic control while reducing patient burden. This approach advances precision medicine in diabetes management, with potential for future real-world applications.

Deep reinforcement learning for Type 1 Diabetes: Dual PPO controller for personalized insulin management

Marchetti A.;Matarrese M. A. G.;Merone M.
2025-01-01

Abstract

Background: Managing blood glucose levels in Type 1 Diabetes Mellitus (T1DM) is essential to prevent complications. Traditional insulin delivery methods often require significant patient involvement, limiting automation. Reinforcement Learning (RL)-based controllers offer a promising approach for improving automated insulin administration. Methods: We propose a Dual Proximal Policy Optimization (Dual PPO) controller for personalized insulin delivery in a hybrid closed-loop system. The controller optimizes patient-specific insulin bounds through a grid search on pre-trained models to manage both hyperglycemia and hypoglycemia. A safe-control mechanism prevents insulin administration when glucose levels drop below a predefined threshold. The system was evaluated on 10 in silico adult patients using the UVA/Padova simulator, with five-day randomized meal scenarios. Results: The Dual PPO controller significantly improved Time in Range (TIR) (69.30% ±1.61) compared to a single PPO model (61.69% ±1.54). The system effectively reduced severe hyperglycemia while maintaining a low incidence of severe hypoglycemia. Unlike conventional open-loop methods such as Basal-Bolus (BBC) and Proportional-Integral-Derivative (PIDC) controllers, our system requires minimal patient interaction, eliminating the need for carbohydrate estimation. Conclusions: The Dual PPO controller enhances personalized insulin delivery in T1DM, improving glycemic control while reducing patient burden. This approach advances precision medicine in diabetes management, with potential for future real-world applications.
2025
Diabetes; Glycemic control; Precision medicine; Proximal Policy Optimization; Reinforcement learning
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0010482525004986-main.pdf

accesso aperto

Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 3.21 MB
Formato Adobe PDF
3.21 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.12610/87923
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact