Influence of Treebank Design on Representation of Multiword Expressions
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F11%3A10107803" target="_blank" >RIV/00216208:11320/11:10107803 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-642-19400-9" target="_blank" >http://dx.doi.org/10.1007/978-3-642-19400-9</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-19400-9" target="_blank" >10.1007/978-3-642-19400-9</a>
Alternative languages
Result language
angličtina
Original language name
Influence of Treebank Design on Representation of Multiword Expressions
Original language description
Multiword Expressions (MWEs) are important linguistic units that require special treatment in many NLP applications. It is thus desirable to be able to recognize them automatically. Semantically annotated corpora should mark MWEs in a clear way that facilitates development of automatic recognition tools. In the present paper we discuss various corpus design decisions from this perspective. We propose guidelines that should lead to MWE-friendly annotation and evaluate them on numerous sentence examples.Our experience of identifying MWEs in the Prague Dependency Treebank provides the base for the discussion and examples from other languages are added whenever appropriate.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2011
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Lecture Notes in Computer Science
ISSN
0302-9743
e-ISSN
—
Volume of the periodical
6608
Issue of the periodical within the volume
1
Country of publishing house
DE - GERMANY
Number of pages
14
Pages from-to
1-14
UT code for WoS article
—
EID of the result in the Scopus database
—