An experimental comparison of feature selection methods on two-class biomedical datasets
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26220%2F15%3APU116428" target="_blank" >RIV/00216305:26220/15:PU116428 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1016/j.compbiomed.2015.08.010" target="_blank" >http://dx.doi.org/10.1016/j.compbiomed.2015.08.010</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1016/j.compbiomed.2015.08.010" target="_blank" >10.1016/j.compbiomed.2015.08.010</a>
Alternative languages
Result language
angličtina
Original language name
An experimental comparison of feature selection methods on two-class biomedical datasets
Original language description
Feature selection is a significant part of many machine learning applications dealing with small-sample and high-dimensional data. Choosing the most important features is an essential step for knowledge discovery in many areas of biomedical informatics. The increased popularity of feature selection methods and their frequent utilisation raise challenging new questions about the interpretability and stability of feature selection techniques. In this study, we compared the behaviour of ten state-of-the-art filter methods for feature selection in terms of their stability, similarity, and influence on prediction performance. All of the experiments were conducted on eight two-class datasets from biomedical areas. While entropy-based feature selection appears to be the most stable, the feature selection techniques yielding the highest prediction performance are minimum redundance maximum relevance method and feature selection based on Bhattacharyya distance. In general, univariate feature selection techniques perform similarly to or even better than more complex multivariate feature selection techniques with high-dimensional datasets. However, with more complex and smaller datasets multivariate methods slightly outperform univariate techniques.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
20201 - Electrical and electronic engineering
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
COMPUTERS IN BIOLOGY AND MEDICINE
ISSN
0010-4825
e-ISSN
1879-0534
Volume of the periodical
66
Issue of the periodical within the volume
1
Country of publishing house
US - UNITED STATES
Number of pages
10
Pages from-to
1-10
UT code for WoS article
000365367400001
EID of the result in the Scopus database
—