TY - GEN
T1 - An empirical study of the effects of principal component analysis on symbolic classifiers
AU - Zhao, Huimin
AU - Sinha, Atish P.
AU - Ram, Sudha
PY - 2008
Y1 - 2008
N2 - Classification is a frequently encountered data mining problem. While symbolic classifiers have high comprehensibility, their language bias may hamper their classification performance. Incorporating new features constructed based on the original features may relax such language bias and lead to performance improvement. Among others, principal component analysis (PCA) has been proposed as a possible method for enhancing the performance of decision trees. However, since PCA is an unsupervised method, the principal components may not represent the ideal projection directions for optimizing the classification performance. Thus, we expect PCA to have varying effects; it may improve classification performance if the projections enhance class differences, but may degrade performance otherwise. We also posit that the effects of PCA are similar on symbolic classifiers, including decision rules, decision trees, and decision tables. In this paper, we empirically evaluate the effects of PCA on symbolic classifiers and discuss the findings.
AB - Classification is a frequently encountered data mining problem. While symbolic classifiers have high comprehensibility, their language bias may hamper their classification performance. Incorporating new features constructed based on the original features may relax such language bias and lead to performance improvement. Among others, principal component analysis (PCA) has been proposed as a possible method for enhancing the performance of decision trees. However, since PCA is an unsupervised method, the principal components may not represent the ideal projection directions for optimizing the classification performance. Thus, we expect PCA to have varying effects; it may improve classification performance if the projections enhance class differences, but may degrade performance otherwise. We also posit that the effects of PCA are similar on symbolic classifiers, including decision rules, decision trees, and decision tables. In this paper, we empirically evaluate the effects of PCA on symbolic classifiers and discuss the findings.
KW - Classification
KW - Data mining
KW - Decision rule
KW - Decision table
KW - Decision tree
KW - Principal component analysis
KW - Symbolic classifier
UR - http://www.scopus.com/inward/record.url?scp=84870366437&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84870366437&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84870366437
SN - 9781605609539
T3 - 14th Americas Conference on Information Systems, AMCIS 2008
SP - 563
EP - 569
BT - 14th Americas Conference on Information Systems, AMCIS 2008
T2 - 14th Americas Conference on Information Systems, AMCIS 2008
Y2 - 14 August 2008 through 17 August 2008
ER -