Czech Verse Processing System KVĚTA: Phonetic and Metrical Components
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68378068%3A_____%2F16%3A00467308" target="_blank" >RIV/68378068:_____/16:00467308 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Czech Verse Processing System KVĚTA: Phonetic and Metrical Components
Popis výsledku v původním jazyce
The following paper describes the algorithms of phonetic and metrical components of the Czech verse processing system KVĚTA, updating information contained in previous reports (Ibrahim and Plecháč 2011; Plecháč et al. 2013a; Ibrahim and Plecháč 2014). The system is being used in the building of the Corpus of Czech Verse (hereinafter CCV), which at present contains 1 689 Czech books of poetry (over 2.5 million lines) from the nineteenth and early twentieth centuries. In contrast to standard language corpora, in each lexical unit are not only the lemma and morphological tag attributes assigned but they also contain a phonetic transcription; furthermore, the attributes metre (iamb, trochee), length (number of feet), ending (feminine, masculine) and metrical patternnare assigned to each verse line. At higher levels rhyme pairs (or n-some) and fixed forms (sonnet, rondel, etc.) are annotated. Here we will focus on components providing phonetic and metrical annotation: (1) the F-component, whose task is to derive the phonetic transcription from the input data, (2) the G-component, whose task is to generate a set of all possible metrical interpretationsnof these data, and (3) the M-component, whose task is to select from this set the final interpretation. Automatic analysis has so far been limited to accentual-syllabic (hereinafter AS) and monometric poems – i. e., poems consisting of repetitions and variations of a single metrical pattern (though AS imitations of some quantitative meters are recognized).
Název v anglickém jazyce
Czech Verse Processing System KVĚTA: Phonetic and Metrical Components
Popis výsledku anglicky
The following paper describes the algorithms of phonetic and metrical components of the Czech verse processing system KVĚTA, updating information contained in previous reports (Ibrahim and Plecháč 2011; Plecháč et al. 2013a; Ibrahim and Plecháč 2014). The system is being used in the building of the Corpus of Czech Verse (hereinafter CCV), which at present contains 1 689 Czech books of poetry (over 2.5 million lines) from the nineteenth and early twentieth centuries. In contrast to standard language corpora, in each lexical unit are not only the lemma and morphological tag attributes assigned but they also contain a phonetic transcription; furthermore, the attributes metre (iamb, trochee), length (number of feet), ending (feminine, masculine) and metrical patternnare assigned to each verse line. At higher levels rhyme pairs (or n-some) and fixed forms (sonnet, rondel, etc.) are annotated. Here we will focus on components providing phonetic and metrical annotation: (1) the F-component, whose task is to derive the phonetic transcription from the input data, (2) the G-component, whose task is to generate a set of all possible metrical interpretationsnof these data, and (3) the M-component, whose task is to select from this set the final interpretation. Automatic analysis has so far been limited to accentual-syllabic (hereinafter AS) and monometric poems – i. e., poems consisting of repetitions and variations of a single metrical pattern (though AS imitations of some quantitative meters are recognized).
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
AJ - Písemnictví, mas–media, audiovize
OECD FORD obor
—
Návaznosti výsledku
Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2016
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Glottotheory
ISSN
1337-7892
e-ISSN
—
Svazek periodika
7
Číslo periodika v rámci svazku
2
Stát vydavatele periodika
DE - Spolková republika Německo
Počet stran výsledku
16
Strana od-do
159-174
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85002152591