TY - JOUR
T1 - A distributional reinforcement learning model for optimal glucose control after cardiac surgery
AU - Desman, Jacob M.
AU - Hong, Zhang Wei
AU - Sabounchi, Moein
AU - Sawant, Ashwin S.
AU - Gill, Jaskirat
AU - Costa, Ana C.
AU - Kumar, Gagan
AU - Sharma, Rajeev
AU - Gupta, Arpeta
AU - McCarthy, Paul
AU - Nandwani, Veena
AU - Powell, Doug
AU - Carideo, Alexandra
AU - Goodwin, Donnie
AU - Ahmed, Sanam
AU - Gidwani, Umesh
AU - Levin, Matthew A.
AU - Varghese, Robin
AU - Filsoufi, Farzan
AU - Freeman, Robert
AU - Shetreat-Klein, Avniel
AU - Charney, Alexander W.
AU - Hofer, Ira
AU - Chan, Lili
AU - Reich, David
AU - Kovatch, Patricia
AU - Kohli-Seth, Roopa
AU - Kraft, Monica
AU - Agrawal, Pulkit
AU - Kellum, John A.
AU - Nadkarni, Girish N.
AU - Sakhuja, Ankit
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - This study introduces Glucose Level Understanding and Control Optimized for Safety and Efficacy (GLUCOSE), a distributional offline reinforcement learning algorithm for optimizing insulin dosing after cardiac surgery. Trained on 5228 patients, tested on 920, and externally validated on 649, GLUCOSE achieved a mean estimated reward of 0.0 [–0.07, 0.06] in internal testing and –0.63 [–0.74, –0.52] in external validation, outperforming clinician returns of –1.29 [–1.37, –1.20] and –1.02 [–1.16, –0.89]. In multi-phase human validation, GLUCOSE first showed a significantly lower mean absolute error (MAE) in insulin dosing, with 0.9 units MAE versus clinicians’ 1.97 units (p < 0.001) in internal testing and 1.90 versus 2.24 units (p = 0.003) in external validation. The second and third phases found GLUCOSE’s performance as comparable to or exceeding that of senior clinicians in MAE, safety, effectiveness, and acceptability. These findings suggest GLUCOSE as a robust tool for improving postoperative glucose management.
AB - This study introduces Glucose Level Understanding and Control Optimized for Safety and Efficacy (GLUCOSE), a distributional offline reinforcement learning algorithm for optimizing insulin dosing after cardiac surgery. Trained on 5228 patients, tested on 920, and externally validated on 649, GLUCOSE achieved a mean estimated reward of 0.0 [–0.07, 0.06] in internal testing and –0.63 [–0.74, –0.52] in external validation, outperforming clinician returns of –1.29 [–1.37, –1.20] and –1.02 [–1.16, –0.89]. In multi-phase human validation, GLUCOSE first showed a significantly lower mean absolute error (MAE) in insulin dosing, with 0.9 units MAE versus clinicians’ 1.97 units (p < 0.001) in internal testing and 1.90 versus 2.24 units (p = 0.003) in external validation. The second and third phases found GLUCOSE’s performance as comparable to or exceeding that of senior clinicians in MAE, safety, effectiveness, and acceptability. These findings suggest GLUCOSE as a robust tool for improving postoperative glucose management.
UR - https://www.scopus.com/pages/publications/105006921885
UR - https://www.scopus.com/pages/publications/105006921885#tab=citedBy
U2 - 10.1038/s41746-025-01709-9
DO - 10.1038/s41746-025-01709-9
M3 - Article
AN - SCOPUS:105006921885
SN - 2398-6352
VL - 8
JO - npj Digital Medicine
JF - npj Digital Medicine
IS - 1
M1 - 313
ER -