TY - GEN
T1 - Co-occurring diseases heavily influence the performance of weakly supervised learning models for classification of chest CT
AU - Tushar, Fakrul Islam
AU - Danniballe, Vincent M.
AU - Rubin, Geoffrey D.
AU - Samei, Ehsan
AU - Lo, Joseph Y.
N1 - Publisher Copyright:
© 2022 SPIE.
PY - 2022
Y1 - 2022
N2 - Despite the potential of weakly supervised learning to automatically annotate massive amounts of data, little is known about its limitations for use in computer-aided diagnosis (CAD). For CT specifically, interpreting the performance of CAD algorithms can be challenging given the large number of co-occurring diseases. This paper examines the effect of co-occurring diseases when training classification models by weakly supervised learning, specifically by comparing multi-label and multiple binary classifiers using the same training data. Our results demonstrated that the binary model outperformed the multi-label classification in every disease category in terms of AUC. However, this performance was heavily influenced by co-occurring diseases in the binary model, suggesting it did not always learn the correct appearance of the specific disease. For example, binary classification of lung nodules resulted in an AUC of < 0.65 when there were no other co-occurring diseases, but when lung nodules cooccurred with emphysema, the performance reached AUC < 0.80. We hope this paper revealed the complexity of interpreting disease classification performance in weakly supervised models and will encourage researchers to examine the effect of co-occurring diseases on classification performance in the future.
AB - Despite the potential of weakly supervised learning to automatically annotate massive amounts of data, little is known about its limitations for use in computer-aided diagnosis (CAD). For CT specifically, interpreting the performance of CAD algorithms can be challenging given the large number of co-occurring diseases. This paper examines the effect of co-occurring diseases when training classification models by weakly supervised learning, specifically by comparing multi-label and multiple binary classifiers using the same training data. Our results demonstrated that the binary model outperformed the multi-label classification in every disease category in terms of AUC. However, this performance was heavily influenced by co-occurring diseases in the binary model, suggesting it did not always learn the correct appearance of the specific disease. For example, binary classification of lung nodules resulted in an AUC of < 0.65 when there were no other co-occurring diseases, but when lung nodules cooccurred with emphysema, the performance reached AUC < 0.80. We hope this paper revealed the complexity of interpreting disease classification performance in weakly supervised models and will encourage researchers to examine the effect of co-occurring diseases on classification performance in the future.
KW - CT
KW - binary classifier
KW - classification
KW - co-occurring diseases
KW - multi-label classifier
KW - weak-supervision
UR - http://www.scopus.com/inward/record.url?scp=85132817166&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85132817166&partnerID=8YFLogxK
U2 - 10.1117/12.2612700
DO - 10.1117/12.2612700
M3 - Conference contribution
AN - SCOPUS:85132817166
T3 - Progress in Biomedical Optics and Imaging - Proceedings of SPIE
BT - Medical Imaging 2022
A2 - Drukker, Karen
A2 - Iftekharuddin, Khan M.
PB - SPIE
T2 - Medical Imaging 2022: Computer-Aided Diagnosis
Y2 - 21 March 2022 through 27 March 2022
ER -