On Implementation of Word-Based Compression Methods
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F08%3A03147357" target="_blank" >RIV/68407700:21230/08:03147357 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
On Implementation of Word-Based Compression Methods
Original language description
The paper presents an implementation of dictionary and statistical word-based data compression methods. The data compression is one of the main techniques of reducing time needed to transmit data over the network. The word-based text compression is a novel compression approach which exploits high correlation between words in sentence. The basic idea of the word-based compression methods is to consider words as source units instead of characters. These methods are efficient especially for natural language compression. Our results prove better compression ratio of word-based methods in comparison to character-based methods. We present generalized concept of dense coding in this paper. This concept allows us to adjust the coding schema to data domain andso achieve better compression ratio.
Czech name
Implementace slovních kompresních metod
Czech description
V tomto článku jsme popsali naše implementace slovních kontextových metod: slovní statistické a slovní slovníkové metody. V rámci této práce jsme také vytvořili nový kódovací systém Open Dense Coding založený na zobecnění End-Tagged Dense Coding. Tento systém výrazně zrychluje kódování ve slovních kompresních metodách.
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GA201%2F06%2F1039" target="_blank" >GA201/06/1039: Text processing and analysis</a><br>
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2008
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
4th Doctoral Workshop on Mathematical and Engineering Methods in Computer Science
ISBN
978-80-7355-082-0
ISSN
—
e-ISSN
—
Number of pages
8
Pages from-to
—
Publisher name
Ing. Zdenek Novotny, CSc.
Place of publication
Brno
Event location
Znojmo
Event date
Nov 14, 2008
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—