Primary Data Collection for Language Description and Documentation
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24510%2F20%3A00008478" target="_blank" >RIV/46747885:24510/20:00008478 - isvavai.cz</a>
Výsledek na webu
<a href="https://digilib.phil.muni.cz/handle/11222.digilib/142574" target="_blank" >https://digilib.phil.muni.cz/handle/11222.digilib/142574</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.5817/ERB2020-1-6" target="_blank" >10.5817/ERB2020-1-6</a>
Alternativní jazyky
Jazyk výsledku
španělština
Název v původním jazyce
Recopilación de datos primarios para la descripción y documentación de la lengua
Popis výsledku v původním jazyce
In September 2018 CIDLeS (Centro Interdisciplinar de Documentação Linguística e Social, Minde, Portugal) in cooperation with Technical University of Liberec, Czech Republic, launched a project: MSCA TUL: Community-Driven Documentation and Description of A Fala. The methodology used in the project is based on primary data collection and its usage for both description and documentation purposes. The objective of the paper is to introduce the design of primary data corpus, which is the basis of the whole project. The primary data have variety of forms: audio and video recordings, written texts published or unpublished, existing linguistic resources, and also the data created or collected by the community of speakers. The paper discusses various aspects to be considered while collecting and processing the data. One of these aspects is the balance between the three main varieties of A Fala, Lagarteiru, Mañegu and Valverdeñu. Another aspect to take into account is the selection of topics for the interviews and also the selection of participants, to achieve age and gender balanced sample. In case of written texts copy rights have to be respected and resolved in cases when it is not possible to get the consent from authors or editors. Last but not least, the size of the corpus was also one of the issues to be considered together with the possibility to enlarge the database easily in the future. The paper exposes the experience gained in the course of data collection and also the gap between the ideal solutions and the viable solutions.
Název v anglickém jazyce
Primary Data Collection for Language Description and Documentation
Popis výsledku anglicky
In September 2018 CIDLeS (Centro Interdisciplinar de Documentação Linguística e Social, Minde, Portugal) in cooperation with Technical University of Liberec, Czech Republic, launched a project: MSCA TUL: Community-Driven Documentation and Description of A Fala. The methodology used in the project is based on primary data collection and its usage for both description and documentation purposes. The objective of the paper is to introduce the design of primary data corpus, which is the basis of the whole project. The primary data have variety of forms: audio and video recordings, written texts published or unpublished, existing linguistic resources, and also the data created or collected by the community of speakers. The paper discusses various aspects to be considered while collecting and processing the data. One of these aspects is the balance between the three main varieties of A Fala, Lagarteiru, Mañegu and Valverdeñu. Another aspect to take into account is the selection of topics for the interviews and also the selection of participants, to achieve age and gender balanced sample. In case of written texts copy rights have to be respected and resolved in cases when it is not possible to get the consent from authors or editors. Last but not least, the size of the corpus was also one of the issues to be considered together with the possibility to enlarge the database easily in the future. The paper exposes the experience gained in the course of data collection and also the gap between the ideal solutions and the viable solutions.
Klasifikace
Druh
J<sub>SC</sub> - Článek v periodiku v databázi SCOPUS
CEP obor
—
OECD FORD obor
60203 - Linguistics
Návaznosti výsledku
Projekt
—
Návaznosti
O - Projekt operacniho programu
Ostatní
Rok uplatnění
2020
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Études romanes de Brno
ISSN
1803-7399
e-ISSN
—
Svazek periodika
41
Číslo periodika v rámci svazku
1
Stát vydavatele periodika
CZ - Česká republika
Počet stran výsledku
12
Strana od-do
87-98
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85089172023