A System for Predictive Writing
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F14%3A00077507" target="_blank" >RIV/00216224:14330/14:00077507 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
A System for Predictive Writing
Original language description
Most predictive writing systems are based on n-gram model with different size. Systems designed for English are easier than those for flective languages since even smaller models allow reasonable coverage. However, the same corpus size is significantly insufficient for languages with many word forms. The paper presents a new predictive writing system based on n-grams calculated from a large corpus. We designed the high-performance server-side script that returns either the most probable endings of a word or the most probable following words. We also designed the client-side script that is suitable for desktop computers without touchscreens. We calculated 150 millions most frequent n-grams for n = 1, . . . , 12 from a Czech corpus and evaluated the writing system on Czech texts. The system was then extended by custom-built model that can consist of domain or user specific n-grams.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Eighth Workshop on Recent Advances in Slavonic Natural Language Processing
ISBN
—
ISSN
2336-4289
e-ISSN
—
Number of pages
8
Pages from-to
11-18
Publisher name
Tribun EU
Place of publication
Brno
Event location
Brno
Event date
Jan 1, 2014
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—