Optimal column subset selection for image classification by genetic algorithms
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27240%2F16%3A86099100" target="_blank" >RIV/61989100:27240/16:86099100 - isvavai.cz</a>
Výsledek na webu
<a href="http://dx.doi.org/10.1007/s10479-016-2331-0" target="_blank" >http://dx.doi.org/10.1007/s10479-016-2331-0</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/s10479-016-2331-0" target="_blank" >10.1007/s10479-016-2331-0</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Optimal column subset selection for image classification by genetic algorithms
Popis výsledku v původním jazyce
Many problems in operations research can be solved by combinatorial optimization. Fixed-length subset selection is a family of combinatorial optimization problems that involve selection of a set of unique objects from a larger superset. Feature selection, p-median problem, and column subset selection problem are three examples of hard problems that involve search for fixed-length subsets. Due to their high complexity, exact algorithms are often infeasible to solve real-world instances of these problems and approximate methods based on various heuristic and metaheuristic (e.g. nature-inspired) approaches are often employed. Selecting column subsets from massive data matrices is an important technique useful for construction of compressed representations and low rank approximations of high-dimensional data. Search for an optimal subset of exactly k columns of a matrix, (Formula presented.), (Formula presented.), is a well-known hard optimization problem with practical implications for data processing and mining. It can be used for unsupervised feature selection, dimensionality reduction, data visualization, and so on. A compressed representation of raw real-world data can contribute, for example, to reduction of algorithm training times in supervised learning, to elimination of overfitting in classification and regression, to facilitation of better data understanding, and to many other benefits. This paper proposes a novel genetic algorithm for the column subset selection problem and evaluates it in a series of computational experiments with image classification. The evaluation shows that the proposed modifications improve the results obtained by artificial evolution. (C) 2016 Springer Science+Business Media New York
Název v anglickém jazyce
Optimal column subset selection for image classification by genetic algorithms
Popis výsledku anglicky
Many problems in operations research can be solved by combinatorial optimization. Fixed-length subset selection is a family of combinatorial optimization problems that involve selection of a set of unique objects from a larger superset. Feature selection, p-median problem, and column subset selection problem are three examples of hard problems that involve search for fixed-length subsets. Due to their high complexity, exact algorithms are often infeasible to solve real-world instances of these problems and approximate methods based on various heuristic and metaheuristic (e.g. nature-inspired) approaches are often employed. Selecting column subsets from massive data matrices is an important technique useful for construction of compressed representations and low rank approximations of high-dimensional data. Search for an optimal subset of exactly k columns of a matrix, (Formula presented.), (Formula presented.), is a well-known hard optimization problem with practical implications for data processing and mining. It can be used for unsupervised feature selection, dimensionality reduction, data visualization, and so on. A compressed representation of raw real-world data can contribute, for example, to reduction of algorithm training times in supervised learning, to elimination of overfitting in classification and regression, to facilitation of better data understanding, and to many other benefits. This paper proposes a novel genetic algorithm for the column subset selection problem and evaluates it in a series of computational experiments with image classification. The evaluation shows that the proposed modifications improve the results obtained by artificial evolution. (C) 2016 Springer Science+Business Media New York
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
IN - Informatika
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/GJ16-25694Y" target="_blank" >GJ16-25694Y: Mnohoparadigmatické algoritmy dolování z dat založené na vyhledávání, fuzzy technologiích a bio-inspirovaných výpočtech</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Ostatní
Rok uplatnění
2016
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Annals of Operations Research
ISSN
0254-5330
e-ISSN
—
Svazek periodika
2016
Číslo periodika v rámci svazku
2016
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
18
Strana od-do
1-18
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-84991018113