CapekDraCor: A New Contribution to the European Programmable Drama Corpora
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989592%3A15210%2F23%3A73620259" target="_blank" >RIV/61989592:15210/23:73620259 - isvavai.cz</a>
Výsledek na webu
<a href="https://sciendo.com/article/10.2478/jazcas-2023-0042" target="_blank" >https://sciendo.com/article/10.2478/jazcas-2023-0042</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.2478/jazcas-2023-0042" target="_blank" >10.2478/jazcas-2023-0042</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
CapekDraCor: A New Contribution to the European Programmable Drama Corpora
Popis výsledku v původním jazyce
The aim of this paper is to present the new CapekDraCor corpus and the DraCor project with its research-oriented concept of a programmable corpora focused on quantitative analyses within the framework of computational literary studies. This presentation demonstrates the way the data are processed with respect to their specific multi-layered structure. The corpus contains all the plays written by Karel and Josef Čapek and the data are processed in a standardized format based on XML and general TEI guidelines for processing drama with a defined basic drama tagset. CapekDraCor also uses the newly created EZdrama format for data processing, which works as an intermediate step from .txt to .xml file as a lightweight YAML-like markup language. A file in this format can be automatically converted into a DraCor-ready XML file with a TEI header. The DraCor digital platform extends the possibilities of large-scale drama analysis with a focus on the dramatic character(s). The basic operationalisation is the interaction within a dramatic configuration, i.e., the scenic co-presence of two speakers, from which network data are automatically extracted, both global networks of interactions of dramas and data characterising individual actors, i.e., literary characters.
Název v anglickém jazyce
CapekDraCor: A New Contribution to the European Programmable Drama Corpora
Popis výsledku anglicky
The aim of this paper is to present the new CapekDraCor corpus and the DraCor project with its research-oriented concept of a programmable corpora focused on quantitative analyses within the framework of computational literary studies. This presentation demonstrates the way the data are processed with respect to their specific multi-layered structure. The corpus contains all the plays written by Karel and Josef Čapek and the data are processed in a standardized format based on XML and general TEI guidelines for processing drama with a defined basic drama tagset. CapekDraCor also uses the newly created EZdrama format for data processing, which works as an intermediate step from .txt to .xml file as a lightweight YAML-like markup language. A file in this format can be automatically converted into a DraCor-ready XML file with a TEI header. The DraCor digital platform extends the possibilities of large-scale drama analysis with a focus on the dramatic character(s). The basic operationalisation is the interaction within a dramatic configuration, i.e., the scenic co-presence of two speakers, from which network data are automatically extracted, both global networks of interactions of dramas and data characterising individual actors, i.e., literary characters.
Klasifikace
Druh
J<sub>SC</sub> - Článek v periodiku v databázi SCOPUS
CEP obor
—
OECD FORD obor
60203 - Linguistics
Návaznosti výsledku
Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Jazykovedny Casopis
ISSN
0021-5597
e-ISSN
1338-4287
Svazek periodika
74
Číslo periodika v rámci svazku
1
Stát vydavatele periodika
SK - Slovenská republika
Počet stran výsledku
10
Strana od-do
244-253
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85181718767