A distributional reinforcement learning model for optimal glucose control after cardiac surgery

  • Jacob M. Desman
  • , Zhang Wei Hong
  • , Moein Sabounchi
  • , Ashwin S. Sawant
  • , Jaskirat Gill
  • , Ana C. Costa
  • , Gagan Kumar
  • , Rajeev Sharma
  • , Arpeta Gupta
  • , Paul McCarthy
  • , Veena Nandwani
  • , Doug Powell
  • , Alexandra Carideo
  • , Donnie Goodwin
  • , Sanam Ahmed
  • , Umesh Gidwani
  • , Matthew A. Levin
  • , Robin Varghese
  • , Farzan Filsoufi
  • , Robert Freeman
  • Avniel Shetreat-Klein, Alexander W. Charney, Ira Hofer, Lili Chan, David Reich, Patricia Kovatch, Roopa Kohli-Seth, Monica Kraft, Pulkit Agrawal, John A. Kellum, Girish N. Nadkarni, Ankit Sakhuja

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

This study introduces Glucose Level Understanding and Control Optimized for Safety and Efficacy (GLUCOSE), a distributional offline reinforcement learning algorithm for optimizing insulin dosing after cardiac surgery. Trained on 5228 patients, tested on 920, and externally validated on 649, GLUCOSE achieved a mean estimated reward of 0.0 [–0.07, 0.06] in internal testing and –0.63 [–0.74, –0.52] in external validation, outperforming clinician returns of –1.29 [–1.37, –1.20] and –1.02 [–1.16, –0.89]. In multi-phase human validation, GLUCOSE first showed a significantly lower mean absolute error (MAE) in insulin dosing, with 0.9 units MAE versus clinicians’ 1.97 units (p < 0.001) in internal testing and 1.90 versus 2.24 units (p = 0.003) in external validation. The second and third phases found GLUCOSE’s performance as comparable to or exceeding that of senior clinicians in MAE, safety, effectiveness, and acceptability. These findings suggest GLUCOSE as a robust tool for improving postoperative glucose management.

Original languageEnglish (US)
Article number313
Journalnpj Digital Medicine
Volume8
Issue number1
DOIs
StatePublished - Dec 2025
Externally publishedYes

ASJC Scopus subject areas

  • Medicine (miscellaneous)
  • Health Informatics
  • Computer Science Applications
  • Health Information Management

Fingerprint

Dive into the research topics of 'A distributional reinforcement learning model for optimal glucose control after cardiac surgery'. Together they form a unique fingerprint.

Cite this