Mapping Diatopic and Diachronic Variation in Spoken Czech: the ORTOFON and DIALEKT Corpora
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F14%3A10289809" target="_blank" >RIV/00216208:11210/14:10289809 - isvavai.cz</a>
Result on the web
<a href="http://www.lrec-conf.org/proceedings/lrec2014/index.html" target="_blank" >http://www.lrec-conf.org/proceedings/lrec2014/index.html</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Mapping Diatopic and Diachronic Variation in Spoken Czech: the ORTOFON and DIALEKT Corpora
Original language description
"ORTOFON and DIALEKT are two corpora of spoken Czech (recordings + transcripts) which are currently being built at the Institute of the Czech National Corpus. The first one (ORTOFON) continues the tradition of the CNC's ORAL series of spoken corpora by focusing on collecting recordings of unscripted informal spoken interactions (""prototypically spoken texts""), but also provides new features, most notably an annotation scheme with multiple tiers per speaker, including orthographic and phonetic transcripts and allowing for a more precise treatment of overlapping speech. Rich speaker- and situation-related metadata are also collected for possible use as factors in sociolinguistic analyses. One of the stated goals is to make the data in the corpus balanced with respect to a subset of these. The second project, DIALEKT, consists in annotating (in a way partially compatible with the ORTOFON corpus) and providing electronic access to historical (1960s--80s) dialect recordings, mainly of a m
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/LM2011023" target="_blank" >LM2011023: Czech National Corpus</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC 2014)
ISBN
978-2-9517408-8-4
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
376-382
Publisher name
European Language Resources Association
Place of publication
Reykjavík, Iceland
Event location
Reykjavík, Iceland
Event date
May 26, 2014
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—