All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Measuring Web Page Similarity Based on Textual and Visual Properties

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F12%3APU96212" target="_blank" >RIV/00216305:26230/12:PU96212 - isvavai.cz</a>

  • Result on the web

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Measuring Web Page Similarity Based on Textual and Visual Properties

  • Original language description

    Measuring web page similarity is a very important task in the area of web mining and information retrieval. This paper introduces the method for measuring web page similarity, which considers both textual and visual properties of pages. Textual properties of a page are described by means of modified weight vector space model. General visual properties are captured via segmentation of a page, which divides a page into visual blocks, properties of which are stored into a vector of visual properties. Theseboth vectors are then used to compute the whole web page similarity. This method will be described in detail and results of several experiments are also introduced in this paper.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    JC - Computer hardware and software

  • OECD FORD branch

Result continuities

  • Project

  • Continuities

    Z - Vyzkumny zamer (s odkazem do CEZ)

Others

  • Publication year

    2012

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    The 11th International Conference on Artificial Intelligence and Soft Computing

  • ISBN

    978-3-642-29349-8

  • ISSN

  • e-ISSN

  • Number of pages

    9

  • Pages from-to

    13-21

  • Publisher name

    Springer Verlag

  • Place of publication

    Zakopane

  • Event location

    Zakopane

  • Event date

    Apr 29, 2012

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article

    000314151300002