Incremental filter and wrapper approaches for feature discretization
Figueiredo, M. A. T.
Neurocomputing Vol. 123, Nº 1, pp. 60 - 74, January, 2014.
ISSN (print): 0925-2312
Scimago Journal Ranking: 0,88 (in 2014)
Digital Object Identifier: 10.1016/j.neucom.2012.10.036
Discrete data representations are necessary, or at least convenient, in many machine learning problems. While feature selection (FS) techniques aim at finding relevant subsets of features, the goal of feature discretization (FD) is to find concise (quantized) data representations, adequate for the learning task at hand. In this paper, we propose two incremental methods for FD. The first method belongs to the filter family, in which the quality of the discretization is assessed by a (supervised or unsupervised) relevance criterion. The second method is a wrapper, where discretized features are assessed using a classifier. Both methods can be coupled
with any static (unsupervised or supervised) discretization procedure and can be used to perform FS as pre-processing or postprocessing stages. The proposed methods attain efficient representations suitable for binary and multi-class problems with different types of data, being competitive with existing methods. Moreover, using well-known FS methods with the features discretized by our techniques leads to better accuracy than with the features discretized by other methods or with the original features.