Morphological Tags in Parallel Corpora
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F10%3A10070358" target="_blank" >RIV/00216208:11210/10:10070358 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Morphological Tags in Parallel Corpora
Original language description
Tagsets, used to annotate corpora, often classify word classes and morphological categories according to different criteria, even within a single language. Texts tagged in disparate ways make searching and automatic processing harder. For a parallel corpus a single "harmonized" tagset could be designed (similarly as in the project MULTEXT-East), or - even better - to encode the information from all tagsets into a morphosyntactic "interlingua" (see Dan Zeman's Interset). The parallel with natural languages is appropriate: problems with missing equivalents occur in the translation of words as well as tags. Thus we propose a tagset interlingua as a hierarchy (lattice) of categories, corrosponding to language-specific tags. A missing tag in a language canbe substituted by a more general tag or a by a disjunction of more specific tags. Similarly as with multilingual lexical databases the methods of Formal Concept Analysis can be used.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2010
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
InterCorp: exploring a multilingual corpus
ISBN
978-80-7422-042-5
ISSN
—
e-ISSN
—
Number of pages
30
Pages from-to
—
Publisher name
Nakladatelství Lidové noviny
Place of publication
Praha
Event location
Praha
Event date
Sep 17, 2009
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—