Genetic Algorithms in Syllable-Based Text Compression
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F07%3A00005155" target="_blank" >RIV/00216208:11320/07:00005155 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Genetic Algorithms in Syllable-Based Text Compression
Original language description
Syllable based text compression is a new approach to compression by symbols. In this concept syllables are used as the compression symbols instead of the more common characters or words. This new technique has proven itself worthy especially on short tomiddle-length text files. The effectiveness of the compression is greatly affected by the quality of dictionaries of syllables characteristic for the certain language. These dictionaries are usually created with a straight-forward analysis of text corpora. In this paper we would like to introduce an other way of obtaining these dictionaries ? using genetic algorithm. We believe, that dictionaries built this way, may help us lower the compress ratio. We will measure this effect on a set of Czech and English texts.
Czech name
Genetické algoritmy ve slabikové kompresi
Czech description
Při kompresi malých textových souborů pomocí slabikové nebo slovní komprese se často používá slovník častých elementů. Tento článek se zabývá možností využití genetických algoritmů pro hledání vhodných slabik a slov, které by v tomto slovníku měli být obsaženy.
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/1ET100300419" target="_blank" >1ET100300419: Intelligent Models, Algorithms, Methods and Tools for the Semantic Web (realization)</a><br>
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2007
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
CEUR Workshop Proceedings
ISSN
1613-0073
e-ISSN
—
Volume of the periodical
235
Issue of the periodical within the volume
Neuveden
Country of publishing house
GB - UNITED KINGDOM
Number of pages
14
Pages from-to
21-34
UT code for WoS article
—
EID of the result in the Scopus database
—