TY - GEN
T1 - Predicting Side Effects of Chemotherapy
T2 - 6th IEEE Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability, ECBIOS 2024
AU - Finkelstein, Joseph
AU - Smiley, Aref
AU - Huo, Xingyue
AU - Echeverria, Christina
AU - Mooney, Kathi
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This study investigates the potential of machine learning (ML) to predict symptom escalation in chemotherapy patients using daily self-reported data. We analyzed self-reported health information from 339 patients, tracking 12 symptoms quantified by severity and distress to compile a comprehensive symptom score ranging from 0 to 230. To address the inherent challenge of dataset imbalance, we employed a stratified sampling technique. This involved dividing the dataset into several subgroups based on the severity of symptoms, and then randomly selecting an equal number of instances from each subgroup. This created a balanced dataset, thereby enhancing the reliability and robustness of our predictive models. We rigorously tested nine diverse ML models, including decision trees, discriminant analysis, and support vector machines (SVM), to predict changes in the total symptom score based on data from the preceding 3 to 5 days. These models were trained on the balanced dataset to mitigate the effects of the original data imbalance, and performance was also compared on the unbalanced dataset to evaluate the impact of data preparation on model efficacy. Our analysis highlighted the SVM classifiers' high accuracy rate of 82% on the unbalanced dataset, though they often misclassified severe symptom changes as less severe. In contrast, the Ensemble algorithm augmented with the RUSBoost classifier demonstrated exceptional precision across both datasets. It achieved accuracy rates of 61.16%, 58.41%, and 60.05% for datasets spanning 3, 4, and 5 days, respectively. This model effectively identified significant symptom changes, emphasizing the critical role of balanced datasets in ML training within health care contexts, where precision significantly affects patient care and outcomes. This research demonstrates the powerful capabilities of ML in cancer care, advocating for advanced algorithmic approaches and the optimized use of balanced datasets to enhance the effectiveness and impact of patient care strategies in cancer symptom management.
AB - This study investigates the potential of machine learning (ML) to predict symptom escalation in chemotherapy patients using daily self-reported data. We analyzed self-reported health information from 339 patients, tracking 12 symptoms quantified by severity and distress to compile a comprehensive symptom score ranging from 0 to 230. To address the inherent challenge of dataset imbalance, we employed a stratified sampling technique. This involved dividing the dataset into several subgroups based on the severity of symptoms, and then randomly selecting an equal number of instances from each subgroup. This created a balanced dataset, thereby enhancing the reliability and robustness of our predictive models. We rigorously tested nine diverse ML models, including decision trees, discriminant analysis, and support vector machines (SVM), to predict changes in the total symptom score based on data from the preceding 3 to 5 days. These models were trained on the balanced dataset to mitigate the effects of the original data imbalance, and performance was also compared on the unbalanced dataset to evaluate the impact of data preparation on model efficacy. Our analysis highlighted the SVM classifiers' high accuracy rate of 82% on the unbalanced dataset, though they often misclassified severe symptom changes as less severe. In contrast, the Ensemble algorithm augmented with the RUSBoost classifier demonstrated exceptional precision across both datasets. It achieved accuracy rates of 61.16%, 58.41%, and 60.05% for datasets spanning 3, 4, and 5 days, respectively. This model effectively identified significant symptom changes, emphasizing the critical role of balanced datasets in ML training within health care contexts, where precision significantly affects patient care and outcomes. This research demonstrates the powerful capabilities of ML in cancer care, advocating for advanced algorithmic approaches and the optimized use of balanced datasets to enhance the effectiveness and impact of patient care strategies in cancer symptom management.
KW - Bioengineering
KW - Machine Learning
KW - Signal Processing & Analysis
UR - https://www.scopus.com/pages/publications/105000192645
UR - https://www.scopus.com/pages/publications/105000192645#tab=citedBy
U2 - 10.1109/ECBIOS61468.2024.10883761
DO - 10.1109/ECBIOS61468.2024.10883761
M3 - Conference contribution
AN - SCOPUS:105000192645
T3 - Proceedings of the 2024 IEEE 6th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability, ECBIOS 2024
SP - 108
EP - 111
BT - Proceedings of the 2024 IEEE 6th Eurasia Conference on Biomedical Engineering, Healthcare and Sustainability, ECBIOS 2024
A2 - Meen, Teen-Hang
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 14 June 2024 through 16 June 2024
ER -