Frequency of Low-Frequency Words in Text Corpora
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F10%3A00067070" target="_blank" >RIV/00216224:14330/10:00067070 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Frequency of Low-Frequency Words in Text Corpora
Original language description
Low-frequency words, esp. words occurring only once in a text corpus, are very popular in text analysis. Also many lexicographers draw attention to such words. This paper lists a detailed statistical analysis of low-frequency words. The results providesimportant information for many practical applications, including lexicography and language modeling.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2010
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of Recent Advances in Slavonic Natural Language Processing, RASLAN 2010
ISBN
9788073992460
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
53-57
Publisher name
Tribun EU
Place of publication
Brno
Event location
Karlova Studánka
Event date
Dec 3, 2010
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—