Informational Cathegorical Data Clustering

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F67985556%3A_____%2F07%3A00098540" target="_blank" >RIV/67985556:_____/07:00098540 - isvavai.cz</a>
Alternative codes found
RIV/68407700:21340/07:04137636
Result on the web
—
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
Informational Cathegorical Data Clustering
Original language description
The EM algorithm has been used repeatedly to identify latent classes in categorical data by estimating finite distribution mixtures of product components. Unfortunately, the underlying mixtures are not uniquely identifiable and, moreover, the estimated mixture parameters are starting-point dependent. For this reason we use the latent class model only to define a set of ``elementary'' classes by estimating a mixture of a large number components. As such a mixture we use also an optimally smoothed kernelestimate. We propose a hierarchical ``bottom up'' cluster analysis based on unifying the elementary latent classes sequentially. The clustering procedure is controlled by minimum information loss criterion.
Czech name
Informační shlukování kategoriálních dat
Czech description
Shlukování kategoriálních dat je často řešeno hledáním tzv. latentních tříd pomocí EM algoritmu. Tento přístup ovšem závisí na počátečním řešení a naráží na problém neidentifikovatelosti směsi. Popisovaná metoda vyhledává shluky nikoliv jako jednotlivé komponenty směsi jako v případě latentních tříd, ale jako podsměsi vzniklé sloučením několika jednoduchých tříd z odhadnuté distribuční směsi s vyšším počtem komponent. Extrémní variantou takové směsi může být jádrový odhad, jehož optimální vyhlazení je vpráci popsáno. V práci je dále představena metoda hierarchického shlukování s kritériem nejmenší informační ztráty.

Classification

Type
D - Article in proceedings
CEP classification
BB - Applied statistics, operational research
OECD FORD branch
—

Result continuities

Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)

Others

Publication year
2007
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Article name in the collection
Doktorandské dny 2007
ISBN
978-80-01-03913-7
ISSN
—
e-ISSN
—
Number of pages
10
Pages from-to
57-66
Publisher name
Česká technika ČVUT
Place of publication
Praha
Event location
Praha
Event date
Nov 16, 2007
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—

Similar results(10)

Minimum Information Loss Cluster Analysis for Cathegorical Data Minimum Information Loss Cluster Analysis for Categorical Data Gaussian Latent Representations for Uncertainty Estimation using Mahalanobis Distance in Deep Classifiers

What are you looking for?

Quick search

Smart search

Informational Cathegorical Data Clustering

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)