The World of Tokens, Tags and Trees
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F18%3A10390102" target="_blank" >RIV/00216208:11320/18:10390102 - isvavai.cz</a>
Result on the web
<a href="http://ufal.mff.cuni.cz/books" target="_blank" >http://ufal.mff.cuni.cz/books</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
The World of Tokens, Tags and Trees
Original language description
This monograph presents a comparative study of annotation approaches to morphology and syntax of natural languages, with emphasis on applicability in a multilingual environment. Annotation is understood as adding linguistic categories and relations to digitally encoded natural language text, resulting in annotated corpus; as syntactic relations are often represented in the form of dependency trees, the annotated corpora covered by the monograph are dependency treebanks. Many treebanks exist and their annotation styles vary significantly, which hampers their usefulness for linguists and language engineers. We survey several harmonization efforts that tried to come up with cross-linguistically applicable annotation guidelines, including the most recent and broadest effort to date, Universal Dependencies. We examine language description on three levels: 1. tokenization and word segmentation, 2. morphology, and 3. surface dependency syntax. For each language phenomenon we provide a comparison of its analy
Czech name
—
Czech description
—
Classification
Type
B - Specialist book
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/GA15-10472S" target="_blank" >GA15-10472S: Morphologically and Syntactically Annotated Corpora of Many Languages</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2018
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
ISBN
978-80-88132-09-7
Number of pages
168
Publisher name
ÚFAL MFF UK
Place of publication
Praha, Czechia
UT code for WoS book
—