Revisiting VMWEs in Hindi: Annotating Layers of Predication
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F25%3ACKE944LC" target="_blank" >RIV/00216208:11320/25:CKE944LC - isvavai.cz</a>
Result on the web
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195167805&partnerID=40&md5=d306023266b50775f9301522992b0057" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85195167805&partnerID=40&md5=d306023266b50775f9301522992b0057</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Revisiting VMWEs in Hindi: Annotating Layers of Predication
Original language description
Multiword expressions in languages like Hindi are both productive and challenging. Hindi not only uses a variety of verbal multiword expressions (VMWEs) but also employs different combinatorial strategies to create new types of multiword expressions. In this paper we are investigating two such strategies that are quite common in the language. Firstly, we describe that VMWEs in Hindi are not just lexical but also morphological. Causatives are formed morphologically in Hindi. Second, we examine Stacked VMWEs i.e. when at least two VMWEs occur together. We suggest that the existing PARSEME annotation framework can be extended to these two phenomena without changing the existing guidelines. We also propose rule-based heuristics using existing Universal Dependency annotations to automatically identify and annotate some of the VMWEs in the language. The goal of this paper is to refine the existing PARSEME corpus of Hindi for VMWEs while expanding its scope giving a more comprehensive picture of VMWEs in Hindi. © European Language Resources Association: CC BY-NC 4.0.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2024
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Jt. Workshop Multiword Expressions Univers. Depend., MWE-UD LREC-COLING - Workshop Proc.
ISBN
978-249381420-3
ISSN
—
e-ISSN
—
Number of pages
8
Pages from-to
98-105
Publisher name
European Language Resources Association (ELRA)
Place of publication
—
Event location
Torino, Italia
Event date
Jan 1, 2025
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—