High-Dimensional Text Clustering by Dimensionality Reduction and Improved Density Peak

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27240%2F20%3A10245959" target="_blank" >RIV/61989100:27240/20:10245959 - isvavai.cz</a>
Výsledek na webu
<a href="https://www.hindawi.com/journals/wcmc/2020/8881112/" target="_blank" >https://www.hindawi.com/journals/wcmc/2020/8881112/</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1155/2020/8881112" target="_blank" >10.1155/2020/8881112</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
High-Dimensional Text Clustering by Dimensionality Reduction and Improved Density Peak
Popis výsledku v původním jazyce
This study focuses on high-dimensional text data clustering, given the inability of K-means to process high-dimensional data and the need to specify the number of clusters and randomly select the initial centers. We propose a Stacked-Random Projection dimensionality reduction framework and an enhanced K-means algorithm DPC-K-means based on the improved density peaks algorithm. The improved density peaks algorithm determines the number of clusters and the initial clustering centers of K-means. Our proposed algorithm is validated using seven text datasets. Experimental results show that this algorithm is suitable for clustering of text data by correcting the defects of K-means.
Název v anglickém jazyce
High-Dimensional Text Clustering by Dimensionality Reduction and Improved Density Peak
Popis výsledku anglicky
This study focuses on high-dimensional text data clustering, given the inability of K-means to process high-dimensional data and the need to specify the number of clusters and randomly select the initial centers. We propose a Stacked-Random Projection dimensionality reduction framework and an enhanced K-means algorithm DPC-K-means based on the improved density peaks algorithm. The improved density peaks algorithm determines the number of clusters and the initial clustering centers of K-means. Our proposed algorithm is validated using seven text datasets. Experimental results show that this algorithm is suitable for clustering of text data by correcting the defects of K-means.

Klasifikace

Druh
J<sub>imp</sub> - Článek v periodiku v databázi Web of Science
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach

Ostatní

Rok uplatnění
2020
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název periodika
Wireless Communications & Mobile Computing
ISSN
1530-8669
e-ISSN
—
Svazek periodika
2020
Číslo periodika v rámci svazku
OCTOBER
Stát vydavatele periodika
GB - Spojené království Velké Británie a Severního Irska
Počet stran výsledku
16
Strana od-do
—
Kód UT WoS článku
000594623600001
EID výsledku v databázi Scopus
—

Podobné výsledky(10)

High-dimensional data clustering algorithm based on stacked-random projection Improving Performance and Accuracy of Local PCA Estimation of the inductive model od objects clustering stability based on the K-means algorithm for different levels of data noise

Co hledáte?

Rychlé hledání

Chytré vyhledávání

High-Dimensional Text Clustering by Dimensionality Reduction and Improved Density Peak

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)