TY - JOUR
T1 - Exploring uncertainty measures in convolutional neural network for semantic segmentation of oral cancer images
AU - Song, Bofan
AU - Li, Shaobai
AU - Sunny, Sumsum
AU - Gurushanth, Keerthi
AU - Mendonca, Pramila
AU - Mukhia, Nirza
AU - Patrick, Sanjana
AU - Peterson, Tyler
AU - Gurudath, Shubha
AU - Raghavan, Subhashini
AU - Tsusennaro, Imchen
AU - Leivon, Shirley T.
AU - Kolur, Trupti
AU - Shetty, Vivek
AU - Bushan, Vidya
AU - Ramesh, Rohan
AU - Pillai, Vijay
AU - Wilder-Smith, Petra
AU - Suresh, Amritha
AU - Kuriakose, Moni Abraham
AU - Birur, Praveen
AU - Liang, Rongguang
N1 - Publisher Copyright:
© 2022 The Authors. Published by SPIE under a Creative Commons Attribution 4.0 International License. Distribution or reproduction of this work in whole or in part requires full attribution of the original publication, including its DOI.
PY - 2022/11/1
Y1 - 2022/11/1
N2 - Significance: Oral cancer is one of the most prevalent cancers, especially in middle- and low-income countries such as India. Automatic segmentation of oral cancer images can improve the diagnostic workflow, which is a significant task in oral cancer image analysis. Despite the remarkable success of deep-learning networks in medical segmentation, they rarely provide uncertainty quantification for their output. Aim: We aim to estimate uncertainty in a deep-learning approach to semantic segmentation of oral cancer images and to improve the accuracy and reliability of predictions. Approach: This work introduced a UNet-based Bayesian deep-learning (BDL) model to segment potentially malignant and malignant lesion areas in the oral cavity. The model can quantify uncertainty in predictions.We also developed an efficient model that increased the inference speed, which is almost six times smaller and two times faster (inference speed) than the original UNet. The dataset in this study was collected using our customized screening platform and was annotated by oral oncology specialists. Results: The proposed approach achieved good segmentation performance as well as good uncertainty estimation performance. In the experiments, we observed an improvement in pixel accuracy and mean intersection over union by removing uncertain pixels. This result reflects that the model provided less accurate predictions in uncertain areas that may need more attention and further inspection. The experiments also showed that with some performance compromises, the efficient model reduced computation time and model size, which expands the potential for implementation on portable devices used in resource-limited settings. Conclusions: Our study demonstrates the UNet-based BDL model not only can perform potentially malignant and malignant oral lesion segmentation, but also can provide informative pixel-level uncertainty estimation. With this extra uncertainty information, the accuracy and reliability of the model's prediction can be improved.
AB - Significance: Oral cancer is one of the most prevalent cancers, especially in middle- and low-income countries such as India. Automatic segmentation of oral cancer images can improve the diagnostic workflow, which is a significant task in oral cancer image analysis. Despite the remarkable success of deep-learning networks in medical segmentation, they rarely provide uncertainty quantification for their output. Aim: We aim to estimate uncertainty in a deep-learning approach to semantic segmentation of oral cancer images and to improve the accuracy and reliability of predictions. Approach: This work introduced a UNet-based Bayesian deep-learning (BDL) model to segment potentially malignant and malignant lesion areas in the oral cavity. The model can quantify uncertainty in predictions.We also developed an efficient model that increased the inference speed, which is almost six times smaller and two times faster (inference speed) than the original UNet. The dataset in this study was collected using our customized screening platform and was annotated by oral oncology specialists. Results: The proposed approach achieved good segmentation performance as well as good uncertainty estimation performance. In the experiments, we observed an improvement in pixel accuracy and mean intersection over union by removing uncertain pixels. This result reflects that the model provided less accurate predictions in uncertain areas that may need more attention and further inspection. The experiments also showed that with some performance compromises, the efficient model reduced computation time and model size, which expands the potential for implementation on portable devices used in resource-limited settings. Conclusions: Our study demonstrates the UNet-based BDL model not only can perform potentially malignant and malignant oral lesion segmentation, but also can provide informative pixel-level uncertainty estimation. With this extra uncertainty information, the accuracy and reliability of the model's prediction can be improved.
KW - Bayesian deep learning
KW - Monte Carlo dropout
KW - oral cancer
KW - semantic segmentation
KW - uncertainty measures of deep learning
UR - http://www.scopus.com/inward/record.url?scp=85141320743&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85141320743&partnerID=8YFLogxK
U2 - 10.1117/1.JBO.27.11.115001
DO - 10.1117/1.JBO.27.11.115001
M3 - Article
C2 - 36329004
AN - SCOPUS:85141320743
SN - 1083-3668
VL - 27
JO - Journal of biomedical optics
JF - Journal of biomedical optics
IS - 11
M1 - 115001
ER -