Text-Based Web Page Classification with Use of Visual Information
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F10%3APU89576" target="_blank" >RIV/00216305:26230/10:PU89576 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Text-Based Web Page Classification with Use of Visual Information
Original language description
As the number of pages on the web is permanently increasing, there is a need to classify pages into categories to facilitate indexing or searching them. In the method proposed here, we use both textual and visual information to find a suitable representation of web page content. In this paper, several term weights, based on TF or TF-IDF weighting are proposed. Modification is based on visual areas, in which the text appears and their visual properties. Some results of experiments are included in the final part of the paper.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2010
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
2010 International Conference on Advances in Social Network Analysis and Mining
ISBN
978-0-7695-4138-9
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
—
Publisher name
IEEE Computer Society
Place of publication
Odense
Event location
Odense
Event date
Aug 9, 2010
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—