Primary Data Collection for Language Description and Documentation

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24510%2F20%3A00008478" target="_blank" >RIV/46747885:24510/20:00008478 - isvavai.cz</a>
Výsledek na webu
<a href="https://digilib.phil.muni.cz/handle/11222.digilib/142574" target="_blank" >https://digilib.phil.muni.cz/handle/11222.digilib/142574</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.5817/ERB2020-1-6" target="_blank" >10.5817/ERB2020-1-6</a>

Alternativní jazyky

Jazyk výsledku
španělština
Název v původním jazyce
Recopilación de datos primarios para la descripción y documentación de la lengua
Popis výsledku v původním jazyce
In September 2018 CIDLeS (Centro Interdisciplinar de Documentação Linguística e Social, Minde, Portugal) in cooperation with Technical University of Liberec, Czech Republic, launched a project: MSCA TUL: Community-Driven Documentation and Description of A Fala. The methodology used in the project is based on primary data collection and its usage for both description and documentation purposes. The objective of the paper is to introduce the design of primary data corpus, which is the basis of the whole project. The primary data have variety of forms: audio and video recordings, written texts published or unpublished, existing linguistic resources, and also the data created or collected by the community of speakers. The paper discusses various aspects to be considered while collecting and processing the data. One of these aspects is the balance between the three main varieties of A Fala, Lagarteiru, Mañegu and Valverdeñu. Another aspect to take into account is the selection of topics for the interviews and also the selection of participants, to achieve age and gender balanced sample. In case of written texts copy rights have to be respected and resolved in cases when it is not possible to get the consent from authors or editors. Last but not least, the size of the corpus was also one of the issues to be considered together with the possibility to enlarge the database easily in the future. The paper exposes the experience gained in the course of data collection and also the gap between the ideal solutions and the viable solutions.
Název v anglickém jazyce
Primary Data Collection for Language Description and Documentation
Popis výsledku anglicky
In September 2018 CIDLeS (Centro Interdisciplinar de Documentação Linguística e Social, Minde, Portugal) in cooperation with Technical University of Liberec, Czech Republic, launched a project: MSCA TUL: Community-Driven Documentation and Description of A Fala. The methodology used in the project is based on primary data collection and its usage for both description and documentation purposes. The objective of the paper is to introduce the design of primary data corpus, which is the basis of the whole project. The primary data have variety of forms: audio and video recordings, written texts published or unpublished, existing linguistic resources, and also the data created or collected by the community of speakers. The paper discusses various aspects to be considered while collecting and processing the data. One of these aspects is the balance between the three main varieties of A Fala, Lagarteiru, Mañegu and Valverdeñu. Another aspect to take into account is the selection of topics for the interviews and also the selection of participants, to achieve age and gender balanced sample. In case of written texts copy rights have to be respected and resolved in cases when it is not possible to get the consent from authors or editors. Last but not least, the size of the corpus was also one of the issues to be considered together with the possibility to enlarge the database easily in the future. The paper exposes the experience gained in the course of data collection and also the gap between the ideal solutions and the viable solutions.

Klasifikace

Druh
J<sub>SC</sub> - Článek v periodiku v databázi SCOPUS
CEP obor
—
OECD FORD obor
60203 - Linguistics

Návaznosti výsledku

Projekt
—
Návaznosti
O - Projekt operacniho programu

Ostatní

Rok uplatnění
2020
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název periodika
Études romanes de Brno
ISSN
1803-7399
e-ISSN
—
Svazek periodika
41
Číslo periodika v rámci svazku
1
Stát vydavatele periodika
CZ - Česká republika
Počet stran výsledku
12
Strana od-do
87-98
Kód UT WoS článku
—
EID výsledku v databázi Scopus
2-s2.0-85089172023

Podobné výsledky(10)

Experience of the A Fala Documentation and Description Project Driven by the Community of Speakers A Fala Dictionary: lagarteiru, mañegu, valverdeñu Czech Text Document Corpus v 2.0

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Primary Data Collection for Language Description and Documentation

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)