Komprese semistrukturovaných dokumentů

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F07%3A00005175" target="_blank" >RIV/00216208:11320/07:00005175 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Compression of Semistructured Documents
Popis výsledku v původním jazyce
EGOTHOR is a search engine that indexes the Web and allows us to search the Web documents. Its hit list contains URL and title of the hits, and also some snippet which tries to shortly show a match. The snippet can be almost always assembled by an algorithm that has a full knowledge of the original document (mostly HTML page). It implies that the search engine is required to store the full text of the documents as part of the index. Such a requirement leads us to an appropriate compression algorithm which would reduce the space demand. One of the solutions could be some use of common compression methods, for instance gzip or bzip2, but it might be preferable to develop a new method which would take advantage of the document structure, or rather, the textual character of the documents. There already exist special compression text algorithms and methods for a compression of XML documents. The aim of this paper is an integration of the two approaches to achieve an optimal level of the com
Název v anglickém jazyce
Compression of Semistructured Documents
Popis výsledku anglicky
EGOTHOR is a search engine that indexes the Web and allows us to search the Web documents. Its hit list contains URL and title of the hits, and also some snippet which tries to shortly show a match. The snippet can be almost always assembled by an algorithm that has a full knowledge of the original document (mostly HTML page). It implies that the search engine is required to store the full text of the documents as part of the index. Such a requirement leads us to an appropriate compression algorithm which would reduce the space demand. One of the solutions could be some use of common compression methods, for instance gzip or bzip2, but it might be preferable to develop a new method which would take advantage of the document structure, or rather, the textual character of the documents. There already exist special compression text algorithms and methods for a compression of XML documents. The aim of this paper is an integration of the two approaches to achieve an optimal level of the com

Klasifikace

Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
JC - Počítačový hardware a software
OECD FORD obor
—

Návaznosti výsledku

Projekt
Výsledek vznikl pri realizaci vícero projektů. Více informací v záložce Projekty.
Návaznosti
Z - Vyzkumny zamer (s odkazem do CEZ)

Ostatní

Rok uplatnění
2007
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název periodika
International Journal of Information Technology
ISSN
1305-2403
e-ISSN
—
Svazek periodika
4
Číslo periodika v rámci svazku
1
Stát vydavatele periodika
GB - Spojené království Velké Británie a Severního Irska
Počet stran výsledku
7
Strana od-do
11-17
Kód UT WoS článku
—
EID výsledku v databázi Scopus
—

Podobné výsledky(10)

Compression of a Set of Files with Natural Language Content Access Rights in Enterprise Full-text Search Mathematical Extension of Full Text Search Engine Indexer

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Komprese semistrukturovaných dokumentů

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)