Data Clustering: From Documents to the Web

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F67985807%3A_____%2F07%3A00048323" target="_blank" >RIV/67985807:_____/07:00048323 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
Data Clustering: From Documents to the Web
Original language description
The chapter provides a survey of some clustering methods relevant to the clustering document collections and, in consequence, Web data. We start with classical methods of cluster analysis which seem to be relevant in approaching to cluster Web data. Thegraph clustering is also described since its methods contribute significantly to clustering Web data. A use of artificial neural networks for clustering has the same motivation. Based on previously presented material, the core of the chapter provides anoverview of approaches to clustering in the Web environment. Particularly, we focus on clustering web search results, in which clustering search engines arrange the search results into groups around a common theme. We conclude with some general considerations concerning the justification of so many clustering algorithms and their application in the Web environment.
Czech name
Shlukování dat: Od dokumentů k Webu
Czech description
Kapitola poskytuje přehled některých shlukovacích metod, včetně jejich principů, vhodných pro shlukování v kolekcích dokumentů a v konečném důsledku i v prostředí internetu, zejména pak v prostředí služby World Wide Web. Posun směrem k webovým aplikacímvedl k zařazení postupů shlukování na grafech, neboť tyto metody jsou velmi užitečné shlukování v prostředí služby WWW. Motivace zařazení metod shlukování založených na neuronových sítích je také motivována rozsáhlostí dat na službě WWW, nezvládnutelnoupomocí klasických algoritmů. Jádrem kapitoly je pak aplikace shlukovaní na výsledky vyhledávání. V závěru jsou uvedeny některé obecné úvahy týkající ospravedlnění existence tak velkého množství shlukovačích algoritmů a jejich aplikace v prostředí službyWWW

Classification

Type
C - Chapter in a specialist book
CEP classification
BB - Applied statistics, operational research
OECD FORD branch
—

Result continuities

Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)

Others

Publication year
2007
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Book/collection name
Web data Management Practices. Emerging Techniques and Technologies
ISBN
1-59904229-0
Number of pages of the result
33
Pages from-to
1-22
Number of pages of the book
—
Publisher name
Idea Group Publishing
Place of publication
Hershey
UT code for WoS chapter
—

Similar results(10)

A New Search Result Clustering Using Haar Wavelet Transform Semantic Analysis of Web Pages using Cluster Analysis and Nonnegative Matrix Factorization Cluster labeling with linked data

What are you looking for?

Quick search

Smart search

Data Clustering: From Documents to the Web

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)