TY - JOUR
T1 - Interpretable and Reliable Oral Cancer Classifier with Attention Mechanism and Expert Knowledge Embedding via Attention Map
AU - Song, Bofan
AU - Zhang, Chicheng
AU - Sunny, Sumsum
AU - KC, Dharma Raj
AU - Li, Shaobai
AU - Gurushanth, Keerthi
AU - Mendonca, Pramila
AU - Mukhia, Nirza
AU - Patrick, Sanjana
AU - Gurudath, Shubha
AU - Raghavan, Subhashini
AU - Tsusennaro, Imchen
AU - Leivon, Shirley T.
AU - Kolur, Trupti
AU - Shetty, Vivek
AU - Bushan, Vidya
AU - Ramesh, Rohan
AU - Pillai, Vijay
AU - Wilder-Smith, Petra
AU - Suresh, Amritha
AU - Kuriakose, Moni Abraham
AU - Birur, Praveen
AU - Liang, Rongguang
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/3
Y1 - 2023/3
N2 - Convolutional neural networks have demonstrated excellent performance in oral cancer detection and classification. However, the end-to-end learning strategy makes CNNs hard to interpret, and it can be challenging to fully understand the decision-making procedure. Additionally, reliability is also a significant challenge for CNN based approaches. In this study, we proposed a neural network called the attention branch network (ABN), which combines the visual explanation and attention mechanisms to improve the recognition performance and interpret the decision-making simultaneously. We also embedded expert knowledge into the network by having human experts manually edit the attention maps for the attention mechanism. Our experiments have shown that ABN performs better than the original baseline network. By introducing the Squeeze-and-Excitation (SE) blocks to the network, the cross-validation accuracy increased further. Furthermore, we observed that some previously misclassified cases were correctly recognized after updating by manually editing the attention maps. The cross-validation accuracy increased from 0.846 to 0.875 with the ABN (Resnet18 as baseline), 0.877 with SE-ABN, and 0.903 after embedding expert knowledge. The proposed method provides an accurate, interpretable, and reliable oral cancer computer-aided diagnosis system through visual explanation, attention mechanisms, and expert knowledge embedding.
AB - Convolutional neural networks have demonstrated excellent performance in oral cancer detection and classification. However, the end-to-end learning strategy makes CNNs hard to interpret, and it can be challenging to fully understand the decision-making procedure. Additionally, reliability is also a significant challenge for CNN based approaches. In this study, we proposed a neural network called the attention branch network (ABN), which combines the visual explanation and attention mechanisms to improve the recognition performance and interpret the decision-making simultaneously. We also embedded expert knowledge into the network by having human experts manually edit the attention maps for the attention mechanism. Our experiments have shown that ABN performs better than the original baseline network. By introducing the Squeeze-and-Excitation (SE) blocks to the network, the cross-validation accuracy increased further. Furthermore, we observed that some previously misclassified cases were correctly recognized after updating by manually editing the attention maps. The cross-validation accuracy increased from 0.846 to 0.875 with the ABN (Resnet18 as baseline), 0.877 with SE-ABN, and 0.903 after embedding expert knowledge. The proposed method provides an accurate, interpretable, and reliable oral cancer computer-aided diagnosis system through visual explanation, attention mechanisms, and expert knowledge embedding.
KW - attention branch network
KW - attention map
KW - attention mechanism
KW - expert knowledge embedding
KW - human-in-the-loop deep learning
KW - visual explanation
UR - http://www.scopus.com/inward/record.url?scp=85149843295&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85149843295&partnerID=8YFLogxK
U2 - 10.3390/cancers15051421
DO - 10.3390/cancers15051421
M3 - Article
AN - SCOPUS:85149843295
SN - 2072-6694
VL - 15
JO - Cancers
JF - Cancers
IS - 5
M1 - 1421
ER -