TY - JOUR
T1 - Lung cancer lesion detection in histopathology images using graph-based sparse PCA network
AU - Ram, Sundaresh
AU - Tang, Wenfei
AU - Bell, Alexander J.
AU - Pal, Ravi
AU - Spencer, Cara
AU - Buschhaus, Alexander
AU - Hatt, Charles R.
AU - diMagliano, Marina Pasca
AU - Rehemtulla, Alnawaz
AU - Rodríguez, Jeffrey J.
AU - Galban, Stefanie
AU - Galban, Craig J.
N1 - Publisher Copyright:
© 2023
PY - 2023/8
Y1 - 2023/8
N2 - Early detection of lung cancer is critical for improvement of patient survival. To address the clinical need for efficacious treatments, genetically engineered mouse models (GEMM) have become integral in identifying and evaluating the molecular underpinnings of this complex disease that may be exploited as therapeutic targets. Assessment of GEMM tumor burden on histopathological sections performed by manual inspection is both time consuming and prone to subjective bias. Therefore, an interplay of needs and challenges exists for computer-aided diagnostic tools, for accurate and efficient analysis of these histopathology images. In this paper, we propose a simple machine learning approach called the graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E). Our method comprises four steps: 1) cascaded graph-based sparse PCA, 2) PCA binary hashing, 3) block-wise histograms, and 4) support vector machine (SVM) classification. In our proposed architecture, graph-based sparse PCA is employed to learn the filter banks of the multiple stages of a convolutional network. This is followed by PCA hashing and block histograms for indexing and pooling. The meaningful features extracted from this GS-PCA are then fed to an SVM classifier. We evaluate the performance of the proposed algorithm on H&E slides obtained from an inducible K-rasG12D lung cancer mouse model using precision/recall rates, Fβ-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC) and show that our algorithm is efficient and provides improved detection accuracy compared to existing algorithms.
AB - Early detection of lung cancer is critical for improvement of patient survival. To address the clinical need for efficacious treatments, genetically engineered mouse models (GEMM) have become integral in identifying and evaluating the molecular underpinnings of this complex disease that may be exploited as therapeutic targets. Assessment of GEMM tumor burden on histopathological sections performed by manual inspection is both time consuming and prone to subjective bias. Therefore, an interplay of needs and challenges exists for computer-aided diagnostic tools, for accurate and efficient analysis of these histopathology images. In this paper, we propose a simple machine learning approach called the graph-based sparse principal component analysis (GS-PCA) network, for automated detection of cancerous lesions on histological lung slides stained by hematoxylin and eosin (H&E). Our method comprises four steps: 1) cascaded graph-based sparse PCA, 2) PCA binary hashing, 3) block-wise histograms, and 4) support vector machine (SVM) classification. In our proposed architecture, graph-based sparse PCA is employed to learn the filter banks of the multiple stages of a convolutional network. This is followed by PCA hashing and block histograms for indexing and pooling. The meaningful features extracted from this GS-PCA are then fed to an SVM classifier. We evaluate the performance of the proposed algorithm on H&E slides obtained from an inducible K-rasG12D lung cancer mouse model using precision/recall rates, Fβ-score, Tanimoto coefficient, and area under the curve (AUC) of the receiver operator characteristic (ROC) and show that our algorithm is efficient and provides improved detection accuracy compared to existing algorithms.
KW - Cancer lesion detection
KW - Computational imaging
KW - Graph-based sparse PCA
KW - Image analysis
KW - Machine learning
UR - http://www.scopus.com/inward/record.url?scp=85161050238&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85161050238&partnerID=8YFLogxK
U2 - 10.1016/j.neo.2023.100911
DO - 10.1016/j.neo.2023.100911
M3 - Article
C2 - 37269818
AN - SCOPUS:85161050238
SN - 1522-8002
VL - 42
JO - Neoplasia (United States)
JF - Neoplasia (United States)
M1 - 100911
ER -