Web Page Classification based on Schema.org Collection
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27240%2F12%3A86085113" target="_blank" >RIV/61989100:27240/12:86085113 - isvavai.cz</a>
Alternative codes found
RIV/61989100:27740/12:86085113
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Web Page Classification based on Schema.org Collection
Original language description
The internet is a library of a huge amount of information and there is a need for categorize its content based on web page classification. Classification of web page content can improve the quality of web search and its accuracy. Unfortunately the high dimensionality of the web pages dataset has made the process of classification difficult. The use of an automatic method for web page classification can simplify the whole process and assist the search engine in getting more relevant results. Nowadays information on the web is generally structured and formatted in a not formal way. This absence of semantics leads to create formal methods to provide more semantics information into web page. Search engines including Bing, Google, Yahoo! and Yandex formed collection of schemas Schema.org to support web page semantics and improve their search results. This paper explores the use of formal source code structure for classifying a large collection of the web content. Is focused on use of schema
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2012
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 2012 4th International Conference on Computational Aspects of Social Networks, CASoN 2012 : 21 ? 23 November 2012, S?o Carlos, Brazil
ISBN
978-1-4673-4793-8
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
356-360
Publisher name
IEEE
Place of publication
New York
Event location
Sao Carlos
Event date
Nov 21, 2012
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000314803000060