Measuring Web Page Similarity Based on Textual and Visual Properties
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F12%3APU96212" target="_blank" >RIV/00216305:26230/12:PU96212 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Measuring Web Page Similarity Based on Textual and Visual Properties
Original language description
Measuring web page similarity is a very important task in the area of web mining and information retrieval. This paper introduces the method for measuring web page similarity, which considers both textual and visual properties of pages. Textual properties of a page are described by means of modified weight vector space model. General visual properties are captured via segmentation of a page, which divides a page into visual blocks, properties of which are stored into a vector of visual properties. Theseboth vectors are then used to compute the whole web page similarity. This method will be described in detail and results of several experiments are also introduced in this paper.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2012
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
The 11th International Conference on Artificial Intelligence and Soft Computing
ISBN
978-3-642-29349-8
ISSN
—
e-ISSN
—
Number of pages
9
Pages from-to
13-21
Publisher name
Springer Verlag
Place of publication
Zakopane
Event location
Zakopane
Event date
Apr 29, 2012
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000314151300002