Czech Verse Processing System KVĚTA: Phonetic and Metrical Components

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68378068%3A_____%2F16%3A00467308" target="_blank" >RIV/68378068:_____/16:00467308 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Czech Verse Processing System KVĚTA: Phonetic and Metrical Components
Popis výsledku v původním jazyce
The following paper describes the algorithms of phonetic and metrical components of the Czech verse processing system KVĚTA, updating information contained in previous reports (Ibrahim and Plecháč 2011; Plecháč et al. 2013a; Ibrahim and Plecháč 2014). The system is being used in the building of the Corpus of Czech Verse (hereinafter CCV), which at present contains 1 689 Czech books of poetry (over 2.5 million lines) from the nineteenth and early twentieth centuries. In contrast to standard language corpora, in each lexical unit are not only the lemma and morphological tag attributes assigned but they also contain a phonetic transcription; furthermore, the attributes metre (iamb, trochee), length (number of feet), ending (feminine, masculine) and metrical patternnare assigned to each verse line. At higher levels rhyme pairs (or n-some) and fixed forms (sonnet, rondel, etc.) are annotated. Here we will focus on components providing phonetic and metrical annotation: (1) the F-component, whose task is to derive the phonetic transcription from the input data, (2) the G-component, whose task is to generate a set of all possible metrical interpretationsnof these data, and (3) the M-component, whose task is to select from this set the final interpretation. Automatic analysis has so far been limited to accentual-syllabic (hereinafter AS) and monometric poems – i. e., poems consisting of repetitions and variations of a single metrical pattern (though AS imitations of some quantitative meters are recognized).
Název v anglickém jazyce
Czech Verse Processing System KVĚTA: Phonetic and Metrical Components
Popis výsledku anglicky
The following paper describes the algorithms of phonetic and metrical components of the Czech verse processing system KVĚTA, updating information contained in previous reports (Ibrahim and Plecháč 2011; Plecháč et al. 2013a; Ibrahim and Plecháč 2014). The system is being used in the building of the Corpus of Czech Verse (hereinafter CCV), which at present contains 1 689 Czech books of poetry (over 2.5 million lines) from the nineteenth and early twentieth centuries. In contrast to standard language corpora, in each lexical unit are not only the lemma and morphological tag attributes assigned but they also contain a phonetic transcription; furthermore, the attributes metre (iamb, trochee), length (number of feet), ending (feminine, masculine) and metrical patternnare assigned to each verse line. At higher levels rhyme pairs (or n-some) and fixed forms (sonnet, rondel, etc.) are annotated. Here we will focus on components providing phonetic and metrical annotation: (1) the F-component, whose task is to derive the phonetic transcription from the input data, (2) the G-component, whose task is to generate a set of all possible metrical interpretationsnof these data, and (3) the M-component, whose task is to select from this set the final interpretation. Automatic analysis has so far been limited to accentual-syllabic (hereinafter AS) and monometric poems – i. e., poems consisting of repetitions and variations of a single metrical pattern (though AS imitations of some quantitative meters are recognized).

Klasifikace

Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
AJ - Písemnictví, mas–media, audiovize
OECD FORD obor
—

Návaznosti výsledku

Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Ostatní

Rok uplatnění
2016
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název periodika
Glottotheory
ISSN
1337-7892
e-ISSN
—
Svazek periodika
7
Číslo periodika v rámci svazku
2
Stát vydavatele periodika
DE - Spolková republika Německo
Počet stran výsledku
16
Strana od-do
159-174
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85002152591

Podobné výsledky(10)

Korpus českého verše Automatic Poetic Metre Detection for Czech Verse Úvod do teorie verše

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Czech Verse Processing System KVĚTA: Phonetic and Metrical Components

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)