An empirical study of the effects of principal component analysis on symbolic classifiers

Huimin Zhao, Atish P. Sinha, Sudha Ram

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Classification is a frequently encountered data mining problem. While symbolic classifiers have high comprehensibility, their language bias may hamper their classification performance. Incorporating new features constructed based on the original features may relax such language bias and lead to performance improvement. Among others, principal component analysis (PCA) has been proposed as a possible method for enhancing the performance of decision trees. However, since PCA is an unsupervised method, the principal components may not represent the ideal projection directions for optimizing the classification performance. Thus, we expect PCA to have varying effects; it may improve classification performance if the projections enhance class differences, but may degrade performance otherwise. We also posit that the effects of PCA are similar on symbolic classifiers, including decision rules, decision trees, and decision tables. In this paper, we empirically evaluate the effects of PCA on symbolic classifiers and discuss the findings.

Original languageEnglish (US)
Title of host publication14th Americas Conference on Information Systems, AMCIS 2008
Pages563-569
Number of pages7
StatePublished - 2008
Event14th Americas Conference on Information Systems, AMCIS 2008 - Toronto, ON, Canada
Duration: Aug 14 2008Aug 17 2008

Publication series

Name14th Americas Conference on Information Systems, AMCIS 2008
Volume1

Other

Other14th Americas Conference on Information Systems, AMCIS 2008
Country/TerritoryCanada
CityToronto, ON
Period8/14/088/17/08

Keywords

  • Classification
  • Data mining
  • Decision rule
  • Decision table
  • Decision tree
  • Principal component analysis
  • Symbolic classifier

ASJC Scopus subject areas

  • Computer Science Applications
  • Computer Networks and Communications
  • Library and Information Sciences
  • Information Systems

Fingerprint

Dive into the research topics of 'An empirical study of the effects of principal component analysis on symbolic classifiers'. Together they form a unique fingerprint.

Cite this