The InterCorp parallel corpus with a uniform annotation for all languages
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F23%3A10474273" target="_blank" >RIV/00216208:11210/23:10474273 - isvavai.cz</a>
Result on the web
<a href="https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=LQ7MouwpKo" target="_blank" >https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=LQ7MouwpKo</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.2478/jazcas-2023-0043" target="_blank" >10.2478/jazcas-2023-0043</a>
Alternative languages
Result language
angličtina
Original language name
The InterCorp parallel corpus with a uniform annotation for all languages
Original language description
Recently, the language-specific morphosyntactic annotation of InterCorp, a large multilingual parallel corpus, has been replaced by the language-uniform morphosyntactic and syntactic annotation following the guidelines of the Universal Dependencies project. Because the corpus is used predominantly by human users via a token-based concordancer, the CONLL-U format produced by the UDPipe parser has been extended by attributes such as lemma of the token's syntactic head or morphosyntactic categories of the content verb's auxiliary. We conclude that despite some theoretical and practical issues, the new annotation is a promising solution to the issue of mutually incompatible tagsets within a single corpus.
Czech name
—
Czech description
—
Classification
Type
J<sub>SC</sub> - Article in a specialist periodical, which is included in the SCOPUS database
CEP classification
—
OECD FORD branch
60203 - Linguistics
Result continuities
Project
<a href="/en/project/LM2023044" target="_blank" >LM2023044: Czech National Corpus</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Jazykovedný Časopis
ISSN
0021-5597
e-ISSN
1338-4287
Volume of the periodical
74
Issue of the periodical within the volume
1
Country of publishing house
SK - SLOVAKIA
Number of pages
12
Pages from-to
254-265
UT code for WoS article
—
EID of the result in the Scopus database
2-s2.0-85181744697