OntoSenseNet: A Verb-Centric Ontological Resource for Indian Languages
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AFICG884D" target="_blank" >RIV/00216208:11320/23:FICG884D - isvavai.cz</a>
Výsledek na webu
<a href="https://www.scopus.com/inward/record.uri?eid=2-s2.0-85149966896&doi=10.1007%2f978-3-031-23804-8_3&partnerID=40&md5=a36746305770ea60a31e6e51b9d699a0" target="_blank" >https://www.scopus.com/inward/record.uri?eid=2-s2.0-85149966896&doi=10.1007%2f978-3-031-23804-8_3&partnerID=40&md5=a36746305770ea60a31e6e51b9d699a0</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-031-23804-8_3" target="_blank" >10.1007/978-3-031-23804-8_3</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
OntoSenseNet: A Verb-Centric Ontological Resource for Indian Languages
Popis výsledku v původním jazyce
"Following approaches for understanding lexical meaning developed by Yāska, Patanjali and Bhartrihari from Indian linguistic traditions and extending approaches developed by Leibniz and Brentano in the modern times, a framework of formal ontology of language was developed. This framework proposes that meaning of words are in-formed by intrinsic and extrinsic ontological structures. The paper aims to capture such intrinsic and extrinsic meanings of words for two major Indian languages, namely, Hindi and Telugu. Parts-of-speech have been rendered into sense-types and sense-classes. Using them we have developed a gold-standard annotated lexical resource to support semantic understanding of a language. The resource has collection of Hindi and Telugu lexicons, which has been manually annotated by native speakers of the languages following our annotation guidelines. Further, the resource was utilised to derive adverbial sense-class distribution of verbs and kāraka-verb sense-type distribution. Different corpora (news, novels) were compared using verb sense-types distribution. Word Embedding was used as an aid for the enrichment of the resource. This is a work in progress that aims at lexical coverage of language extensively. © 2023, Springer Nature Switzerland AG."
Název v anglickém jazyce
OntoSenseNet: A Verb-Centric Ontological Resource for Indian Languages
Popis výsledku anglicky
"Following approaches for understanding lexical meaning developed by Yāska, Patanjali and Bhartrihari from Indian linguistic traditions and extending approaches developed by Leibniz and Brentano in the modern times, a framework of formal ontology of language was developed. This framework proposes that meaning of words are in-formed by intrinsic and extrinsic ontological structures. The paper aims to capture such intrinsic and extrinsic meanings of words for two major Indian languages, namely, Hindi and Telugu. Parts-of-speech have been rendered into sense-types and sense-classes. Using them we have developed a gold-standard annotated lexical resource to support semantic understanding of a language. The resource has collection of Hindi and Telugu lexicons, which has been manually annotated by native speakers of the languages following our annotation guidelines. Further, the resource was utilised to derive adverbial sense-class distribution of verbs and kāraka-verb sense-type distribution. Different corpora (news, novels) were compared using verb sense-types distribution. Word Embedding was used as an aid for the enrichment of the resource. This is a work in progress that aims at lexical coverage of language extensively. © 2023, Springer Nature Switzerland AG."
Klasifikace
Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
—
Ostatní
Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název statě ve sborníku
"Lect. Notes Comput. Sci."
ISBN
978-303123803-1
ISSN
0302-9743
e-ISSN
—
Počet stran výsledku
14
Strana od-do
32-45
Název nakladatele
Springer Science and Business Media Deutschland GmbH
Místo vydání
—
Místo konání akce
Melaka, Malaysia
Datum konání akce
1. 1. 2023
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—