Digitisation and Automatic Alignment of the DIALOG Corpus: Prosodically Annotated Corpus of Czech Television Debates
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F07%3A10088959" target="_blank" >RIV/00216208:11320/07:10088959 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Digitisation and Automatic Alignment of the DIALOG Corpus: Prosodically Annotated Corpus of Czech Television Debates
Original language description
This article describes the development and automatic processing of the audio-visual DIALOG corpus. The DIALOG corpus is a prosodically annotated corpus of Czech television debates that has been recorded and annotated at the Czech Language Institute of the Academy of Sciences of the Czech Republic. It has recently grown to more than 400 VHS 4-hour tapes and 375 transcribed TV debates. The described digitisation process and automatic alignment enable an easily accessible and user-friendly research environment, supporting the exploration of Czech prosody and its analysis and modelling. This project has been carried out in cooperation with the Institute of Formal and Applied Linguistics of Faculty of Mathematics and Physics, Charles University, Prague. Currently the first version of the DIALOG corpus is available to the public (version 0.1, http://ujc.dialogy.cz). It includes 10 selected and revised hour-long talk shows.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)
Others
Publication year
2007
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 10th International Conference on Text, Speech and Dialogue
ISBN
978-3-540-74627-0
ISSN
—
e-ISSN
—
Number of pages
6
Pages from-to
—
Publisher name
Springer
Place of publication
Berlin / Heidelberg
Event location
Pilsen
Event date
Sep 3, 2007
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—