TY - JOUR
T1 - Multiobjective genetic optimization of diagnostic classifiers with implications for generating receiver operating characteristic curves
AU - Kupinski, Matthew A.
N1 - Funding Information:
Manuscript received July 21, 1998; revised June 30, 1999. This work was supported in part by the US Army Medical Research and Materiel Command under grants DAMD 17-96-1-6058 and DAMD 17-97-1-7202 and in part by the USPHS under Grants CA24806 and RR11459. The Associate Editor responsible for coordinating the review of this paper and recommending its publication was A. Burgess. Asterisk indicates corresponding author. *M. A. Kupinski is with Kurt Rossmann Laboratories, Department of Radiology, The University of Chicago, Chicago, IL 60637 USA.
PY - 1999
Y1 - 1999
N2 - It is well understood that binary classifiers have two implicit objective functions (sensitivity and specificity) describing their performance. Traditional methods of classifier training attempt to combine these two objective functions (or two analogous class performance measures) into one so that conventional scalar optimization techniques can be utilized. This involves incorporating a priori information into the aggregation method so that the resulting performance of the classifier is satisfactory for the task at hand. We have investigated the use of a niched Pareto multiobjective genetic algorithm (GA) for classifier optimization. With niched Pareto GA's, an objective vector is optimized instead of a scalar function, eliminating the need to aggregate classification objective functions. The niched Pareto GA returns a set of optimal solutions that are equivalent in the absence of any information regarding the preferences of the objectives. The a priori knowledge that was used for aggregating the objective functions in conventional classifier training can instead be applied post-optimization to select from one of the series of solutions returned from the multiobjective genetic optimization. We have applied this technique to train a linear classifier and an artificial neural network (ANN), using simulated datasets. The performances of the solutions returned from the multiobjective genetic optimization represent a series of optimal (sensitivity, specificity) pairs, which can be thought of as operating points on a receiver operating characteristic (ROC) curve. All possible ROC curves for a given dataset and classifier are less than or equal to the ROC curve generated by the niched Pareto genetic optimization. Diagnostic classifiers, genetic algorithms, multiobjective optimization, ROC analysis.
AB - It is well understood that binary classifiers have two implicit objective functions (sensitivity and specificity) describing their performance. Traditional methods of classifier training attempt to combine these two objective functions (or two analogous class performance measures) into one so that conventional scalar optimization techniques can be utilized. This involves incorporating a priori information into the aggregation method so that the resulting performance of the classifier is satisfactory for the task at hand. We have investigated the use of a niched Pareto multiobjective genetic algorithm (GA) for classifier optimization. With niched Pareto GA's, an objective vector is optimized instead of a scalar function, eliminating the need to aggregate classification objective functions. The niched Pareto GA returns a set of optimal solutions that are equivalent in the absence of any information regarding the preferences of the objectives. The a priori knowledge that was used for aggregating the objective functions in conventional classifier training can instead be applied post-optimization to select from one of the series of solutions returned from the multiobjective genetic optimization. We have applied this technique to train a linear classifier and an artificial neural network (ANN), using simulated datasets. The performances of the solutions returned from the multiobjective genetic optimization represent a series of optimal (sensitivity, specificity) pairs, which can be thought of as operating points on a receiver operating characteristic (ROC) curve. All possible ROC curves for a given dataset and classifier are less than or equal to the ROC curve generated by the niched Pareto genetic optimization. Diagnostic classifiers, genetic algorithms, multiobjective optimization, ROC analysis.
UR - http://www.scopus.com/inward/record.url?scp=0033169217&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0033169217&partnerID=8YFLogxK
U2 - 10.1109/42.796281
DO - 10.1109/42.796281
M3 - Article
C2 - 10534050
AN - SCOPUS:0033169217
SN - 0278-0062
VL - 18
SP - 675
EP - 685
JO - IEEE Transactions on Medical Imaging
JF - IEEE Transactions on Medical Imaging
IS - 8
ER -