TY - JOUR
T1 - Towards Outcome-Driven Patient Subgroups
T2 - A Machine Learning Analysis Across Six Depression Treatment Studies
AU - Benrimoh, David
AU - Kleinerman, Akiva
AU - Furukawa, Toshi A.
AU - III, Charles F.Reynolds
AU - Lenze, Eric J.
AU - Karp, Jordan
AU - Mulsant, Benoit
AU - Armstrong, Caitrin
AU - Mehltretter, Joseph
AU - Fratila, Robert
AU - Perlman, Kelly
AU - Israel, Sonia
AU - Popescu, Christina
AU - Golden, Grace
AU - Qassim, Sabrina
AU - Anacleto, Alexandra
AU - Tanguay-Sela, Myriam
AU - Kapelner, Adam
AU - Rosenfeld, Ariel
AU - Turecki, Gustavo
N1 - Publisher Copyright:
© 2023 American Association for Geriatric Psychiatry
PY - 2024/3
Y1 - 2024/3
N2 - Background: Major depressive disorder (MDD) is a heterogeneous condition; multiple underlying neurobiological and behavioral substrates are associated with treatment response variability. Understanding the sources of this variability and predicting outcomes has been elusive. Machine learning (ML) shows promise in predicting treatment response in MDD, but its application is limited by challenges to the clinical interpretability of ML models, and clinicians often lack confidence in model results. In order to improve the interpretability of ML models in clinical practice, our goal was to demonstrate the derivation of treatment-relevant patient profiles comprised of clinical and demographic information using a novel ML approach. Methods: We analyzed data from six clinical trials of pharmacological treatment for depression (total n = 5438) using the Differential Prototypes Neural Network (DPNN), a ML model that derives patient prototypes which can be used to derive treatment-relevant patient clusters while learning to generate probabilities for differential treatment response. A model classifying remission and outputting individual remission probabilities for five first-line monotherapies and three combination treatments was trained using clinical and demographic data. Prototypes were evaluated for interpretability by assessing differences in feature distributions (e.g. age, sex, symptom severity) and treatment-specific outcomes. Results: A 3-prototype model achieved an area under the receiver operating curve of 0.66 and an expected absolute improvement in remission rate for those receiving the best predicted treatment of 6.5% (relative improvement of 15.6%) compared to the population remission rate. We identified three treatment-relevant patient clusters. Cluster A patients tended to be younger, to have increased levels of fatigue, and more severe symptoms. Cluster B patients tended to be older, female, have less severe symptoms, and the highest remission rates. Cluster C patients had more severe symptoms, lower remission rates, more psychomotor agitation, more intense suicidal ideation, and more somatic genital symptoms. Conclusion: It is possible to produce novel treatment-relevant patient profiles using ML models; doing so may improve interpretability of ML models and the quality of precision medicine treatments for MDD.
AB - Background: Major depressive disorder (MDD) is a heterogeneous condition; multiple underlying neurobiological and behavioral substrates are associated with treatment response variability. Understanding the sources of this variability and predicting outcomes has been elusive. Machine learning (ML) shows promise in predicting treatment response in MDD, but its application is limited by challenges to the clinical interpretability of ML models, and clinicians often lack confidence in model results. In order to improve the interpretability of ML models in clinical practice, our goal was to demonstrate the derivation of treatment-relevant patient profiles comprised of clinical and demographic information using a novel ML approach. Methods: We analyzed data from six clinical trials of pharmacological treatment for depression (total n = 5438) using the Differential Prototypes Neural Network (DPNN), a ML model that derives patient prototypes which can be used to derive treatment-relevant patient clusters while learning to generate probabilities for differential treatment response. A model classifying remission and outputting individual remission probabilities for five first-line monotherapies and three combination treatments was trained using clinical and demographic data. Prototypes were evaluated for interpretability by assessing differences in feature distributions (e.g. age, sex, symptom severity) and treatment-specific outcomes. Results: A 3-prototype model achieved an area under the receiver operating curve of 0.66 and an expected absolute improvement in remission rate for those receiving the best predicted treatment of 6.5% (relative improvement of 15.6%) compared to the population remission rate. We identified three treatment-relevant patient clusters. Cluster A patients tended to be younger, to have increased levels of fatigue, and more severe symptoms. Cluster B patients tended to be older, female, have less severe symptoms, and the highest remission rates. Cluster C patients had more severe symptoms, lower remission rates, more psychomotor agitation, more intense suicidal ideation, and more somatic genital symptoms. Conclusion: It is possible to produce novel treatment-relevant patient profiles using ML models; doing so may improve interpretability of ML models and the quality of precision medicine treatments for MDD.
KW - artificial intelligence
KW - machine learning
KW - major depression
KW - subgroups
UR - http://www.scopus.com/inward/record.url?scp=85174045952&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85174045952&partnerID=8YFLogxK
U2 - 10.1016/j.jagp.2023.09.009
DO - 10.1016/j.jagp.2023.09.009
M3 - Article
C2 - 37839909
AN - SCOPUS:85174045952
SN - 1064-7481
VL - 32
SP - 280
EP - 292
JO - American Journal of Geriatric Psychiatry
JF - American Journal of Geriatric Psychiatry
IS - 3
ER -