Text Corpus with Errors
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F03%3A00009149" target="_blank" >RIV/00216224:14330/03:00009149 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Text Corpus with Errors
Original language description
This paper presents a description of a Czech text corpus (Chyby) containing various kinds of errors such as spelling, typographical, grammatical, style, lexical. We explain how Chyby has been built, how the errors in it have been discovered, marked and annotated. The classification of the errors is presented and the statistics concerning the types of errors is given. The tools for annotating the errors are also described. To the best of our knowledge, this is first text corpus of this sort prepared forCzech.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2003
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Text, Speech and Dialogue: Sixth International Conference, TSD 2003
ISBN
3-540-200-24-X
ISSN
—
e-ISSN
—
Number of pages
8
Pages from-to
90-97
Publisher name
Springer Verlag
Place of publication
Berlin
Event location
České Budějovice, Czech republic
Event date
Sep 9, 2003
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—