Genetic Algorithms in Syllable-Based Text Compression

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F07%3A00206195" target="_blank" >RIV/00216208:11320/07:00206195 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Genetic Algorithms in Syllable-Based Text Compression
Popis výsledku v původním jazyce
Syllable based text compression is a new approach to compression by symbols. In this concept syllables are used as the compression symbols instead of the more common characters or words. This new technique has proven itself worthy especially on short tomiddle-length text files. The effectiveness of the compression is greatly affected by the quality of dictionaries of syllables characteristic for the certain language. These dictionaries are usually created with a straight-forward analysis of text corpora. In this paper we would like to introduce an other way of obtaining these dictionaries ? using genetic algorithm. We believe, that dictionaries built this way, may help us lower the compress ratio. We will measure this effect on a set of Czech and English texts.
Název v anglickém jazyce
Genetic Algorithms in Syllable-Based Text Compression
Popis výsledku anglicky
Syllable based text compression is a new approach to compression by symbols. In this concept syllables are used as the compression symbols instead of the more common characters or words. This new technique has proven itself worthy especially on short tomiddle-length text files. The effectiveness of the compression is greatly affected by the quality of dictionaries of syllables characteristic for the certain language. These dictionaries are usually created with a straight-forward analysis of text corpora. In this paper we would like to introduce an other way of obtaining these dictionaries ? using genetic algorithm. We believe, that dictionaries built this way, may help us lower the compress ratio. We will measure this effect on a set of Czech and English texts.

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
JC - Počítačový hardware a software
OECD FORD obor
—

Návaznosti výsledku

Projekt
<a href="/cs/project/1ET100300419" target="_blank" >1ET100300419: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)

Ostatní

Rok uplatnění
2007
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
DATESO 2007
ISBN
978-80-7378-002-9
ISSN
—
e-ISSN
—
Počet stran výsledku
14
Strana od-do
—
Název nakladatele
Matfyz Press, Praha
Místo vydání
—
Místo konání akce
Neuveden
Datum konání akce
1. 1. 2007
Typ akce podle státní příslušnosti
EUR - Evropská akce
Kód UT WoS článku
000272455400003

Podobné výsledky(10)

Genetické algoritmy ve slabikové kompresi A syllable-based method for Vietnamese text compression Searching for optimal alphabet for data compression using simulated annealing

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Genetic Algorithms in Syllable-Based Text Compression

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)