Morphologically annotated corpora of Pomak
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F22%3AZ79LJH2I" target="_blank" >RIV/00216208:11320/22:Z79LJH2I - isvavai.cz</a>
Výsledek na webu
<a href="https://aclanthology.org/2022.computel-1.22.pdf" target="_blank" >https://aclanthology.org/2022.computel-1.22.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.18653/v1/2022.computel-1.22" target="_blank" >10.18653/v1/2022.computel-1.22</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Morphologically annotated corpora of Pomak
Popis výsledku v původním jazyce
The project Philotis is developing a platform to enable researchers of living languages to easily create and make available state-of-the-art spoken and textual annotated resources. As a case study we use Greek and Pomak, the latter being an endangered oral Slavic language of the Balkans (including Thrace/Greece). The linguistic documentation of Pomak is an ongoing work by an interdisciplinary team in close cooperation with the Pomak community of Greece. We describe our experience in the development of a Latin-based orthography and morphologically annotated text corpora of Pomak with state-of-the-art NLP technology. These resources will be made openly available on the Philotis site and the gold annotated corpora of Pomak will be made available on the Universal Dependencies treebank repository.
Název v anglickém jazyce
Morphologically annotated corpora of Pomak
Popis výsledku anglicky
The project Philotis is developing a platform to enable researchers of living languages to easily create and make available state-of-the-art spoken and textual annotated resources. As a case study we use Greek and Pomak, the latter being an endangered oral Slavic language of the Balkans (including Thrace/Greece). The linguistic documentation of Pomak is an ongoing work by an interdisciplinary team in close cooperation with the Pomak community of Greece. We describe our experience in the development of a Latin-based orthography and morphologically annotated text corpora of Pomak with state-of-the-art NLP technology. These resources will be made openly available on the Philotis site and the gold annotated corpora of Pomak will be made available on the Universal Dependencies treebank repository.
Klasifikace
Druh
O - Ostatní výsledky
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
—
Ostatní
Rok uplatnění
2022
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů