Publication details

Improving feature selection process resistance to failures caused by curse-of-dimensionality effects

Journal Article

Somol Petr, Grim Jiří, Novovičová Jana, Pudil P.

serial: Kybernetika vol.47, 3 (2011), p. 401-425

research: CEZ:AV0Z10750506

project(s): 1M0572, GA MŠk, 2C06019, GA MŠk, GA102/08/0593, GA ČR

keywords: feature selection, curse of dimensionality, over-fitting, stability, machine learning, dimensionality reduction

preview: Download

abstract (eng):

The purpose of feature selection in machine learning is at least two-fold – saving measurement acquisition costs and reducing the negative effects of the curse of dimensionality with the aim to improve the accuracy of the models and the classification rate of classifiers with respect to previously unknown data. Yet it has been shown recently that the process of feature selection itself can be negatively affected by the very same curse of dimensionality – feature selection methods may easily over-fit or perform unstably. Such an outcome is unlikely to generalize well and the resulting recognition system may fail to deliver the expectable performance. In many tasks, it is therefore crucial to employ additional mechanisms of making the feature selection process more stable and resistant the curse of dimensionality effects. In this paper we discuss three different approaches to reducing this problem.