TY - JOUR
T1 - Ideal observer approximation using Bayesian classification neural networks
AU - Kupinski, Matthew A.
AU - Edwards, Darrin C.
AU - Giger, Maryellen L.
AU - Metz, Charles E.
N1 - Funding Information:
Manuscript received December 8, 1999; revised June 15, 2001. This work was supported in part by the US Army Medical Research and Materieal Command under Grant DAMD 17-96-1-6058 and Grant 17-97-1-7202, and in part by the United States Public Health Service (USPHS) under Grant RR11459, Grant T32 CA09649, and Grant GM 57622. The Associate Editor responsible for coordinating the review of this paper and recommending its publication was S. Pizer. Asterisk indicates corresponding author. *M. A. Kupinski is with the Department of Radiology, MC2026, The University of Chicago, Chicago, IL 60637 USA (e-mail: [email protected]).
PY - 2001/9
Y1 - 2001/9
N2 - It is well understood that the optimal classification decision variable is the likelihood ratio or any monotonic transformation of the likelihood ratio. An automated classifier which maps from an input space to one of the likelihood ratio family of decision variables is an optimal classifier or "ideal observer" Artificial neural networks (ANNs) are frequently used as classifiers for many problems. In the limit of large training sample sizes, an ANN approximates a mapping function which is a monotonic transformation of the likelihood ratio, i.e., it estimates an ideal observer decision variable. A principal disadvantage of conventional ANNs is the potential over-parameterization of the mapping function which results in a poor approximation of an optimal mapping function for smaller training samples. Recently, Bayesian methods have been applied to ANNs in order to regularize training to improve the robustness of the classifier. The goal of training a Bayesian ANN with finite sample sizes is, as with unlimited data, to approximate the ideal observer. We have evaluated the accuracy of Bayesian ANN models of ideal observer decision variables as a function of the number of hidden units used, the signal-to-noise ratio of the data and the number of features or dimensionality of the data. We show that when enough training data are present, excess hidden units do not substantially degrade the accuracy of Bayesian ANNs. However, the minimum number of hidden units required to best model the optimal mapping function varies with the complexity of the data.
AB - It is well understood that the optimal classification decision variable is the likelihood ratio or any monotonic transformation of the likelihood ratio. An automated classifier which maps from an input space to one of the likelihood ratio family of decision variables is an optimal classifier or "ideal observer" Artificial neural networks (ANNs) are frequently used as classifiers for many problems. In the limit of large training sample sizes, an ANN approximates a mapping function which is a monotonic transformation of the likelihood ratio, i.e., it estimates an ideal observer decision variable. A principal disadvantage of conventional ANNs is the potential over-parameterization of the mapping function which results in a poor approximation of an optimal mapping function for smaller training samples. Recently, Bayesian methods have been applied to ANNs in order to regularize training to improve the robustness of the classifier. The goal of training a Bayesian ANN with finite sample sizes is, as with unlimited data, to approximate the ideal observer. We have evaluated the accuracy of Bayesian ANN models of ideal observer decision variables as a function of the number of hidden units used, the signal-to-noise ratio of the data and the number of features or dimensionality of the data. We show that when enough training data are present, excess hidden units do not substantially degrade the accuracy of Bayesian ANNs. However, the minimum number of hidden units required to best model the optimal mapping function varies with the complexity of the data.
KW - Bayesian neural networks
KW - Computer-aided diagnosis
KW - Ideal observers
KW - ROC analysis
UR - http://www.scopus.com/inward/record.url?scp=0035437718&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0035437718&partnerID=8YFLogxK
U2 - 10.1109/42.952727
DO - 10.1109/42.952727
M3 - Article
C2 - 11585206
AN - SCOPUS:0035437718
SN - 0278-0062
VL - 20
SP - 886
EP - 899
JO - IEEE Transactions on Medical Imaging
JF - IEEE Transactions on Medical Imaging
IS - 9
ER -