Abstract
In this paper we propose a new approach for classification problems and apply it to eight problems. A classification problem with a large feature set is partitioned using clustering on a subset of the features. A multi-label classifier is then trained individually on each cluster, using automatic feature selection to customize the feature set for the cluster. The algorithm achieves one to two percent higher accuracy for most of the problems investigated in this study.
Original language | English (US) |
---|---|
Pages (from-to) | 1288-1299 |
Number of pages | 12 |
Journal | Statistics and Probability Letters |
Volume | 77 |
Issue number | 12 |
DOIs | |
State | Published - Jul 1 2007 |
Keywords
- Classification
- Clustering
- Feature selection
ASJC Scopus subject areas
- Statistics and Probability
- Statistics, Probability and Uncertainty