Study on Phrases Used for Semi-automatic Text-based Speakers? Names Extraction in the Czech Radio Broadcasts News
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F14%3A%230003003" target="_blank" >RIV/46747885:24220/14:#0003003 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-319-10816-2_50" target="_blank" >http://dx.doi.org/10.1007/978-3-319-10816-2_50</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-10816-2_50" target="_blank" >10.1007/978-3-319-10816-2_50</a>
Alternative languages
Result language
angličtina
Original language name
Study on Phrases Used for Semi-automatic Text-based Speakers? Names Extraction in the Czech Radio Broadcasts News
Original language description
In this paper we introduce a methodology leading to the extension of speakers' database used in the process of automatic transcription of spoken documents stored in the largest Czech Radio audio archive. We address the issue of the conversion of spoken speech to written texts - the automatic detection of speakers and their names. We work with a subset of the archive that consists of 8,020 hours of broadcasting news and 58,914,179 words within the years 1968-2011. We observed the occurrence of thousandsof speakers' names during the period and therefore it is necessary to use their automatic or semi-automatic identification. Another investigated issue leading to the extension of speakers' database is the co-occurrence of a speaker's name in a specific phrase in the text transcription linked with the speaker's change in the audio recording.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/DF11P01OVV013" target="_blank" >DF11P01OVV013: Disclosure of the Czech Radio archive for sophisticated search</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proc. of 17th International Conference, TSD 2014
ISBN
9783319108155
ISSN
0302-9743
e-ISSN
—
Number of pages
8
Pages from-to
416-423
Publisher name
Springer-Verlag Berlin Heidelberg
Place of publication
Berlín, Spolková republika Německo
Event location
Brno, Česká Republika
Event date
Jan 1, 2014
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—