Using Suprasegmental Information in Recognized Speech Punctuation Completion
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F14%3A%230003004" target="_blank" >RIV/46747885:24220/14:#0003004 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-319-10816-2_67" target="_blank" >http://dx.doi.org/10.1007/978-3-319-10816-2_67</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-10816-2_67" target="_blank" >10.1007/978-3-319-10816-2_67</a>
Alternative languages
Result language
angličtina
Original language name
Using Suprasegmental Information in Recognized Speech Punctuation Completion
Original language description
We propose a scheme to determine punctuation of the text produced by an automatic speech recognizer. We deal with the addition of commas based on the recognized text and we propose a full stop detection scheme using both - the textual and prosody information. We also propose an expanded scheme which utilizes enriched audio document information (e.g. speaker diarization, language detection etc.) to improve the sentence boundary detection. We compare the above mentioned schemes and its accuracy in terms of (in)correctly estimated punctuation markers and its ability to mark the positions of sentence boundaries. Hence we want to show it is better to incorporate all the relevant information sources in one reasonable scheme than to split the document processing into independent layers. Proposed schemes are evaluated over a set of recordings from the Czech (and Czechoslovak) radio broadcasts
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/DF11P01OVV013" target="_blank" >DF11P01OVV013: Disclosure of the Czech Radio archive for sophisticated search</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2014
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proc. of 17th International Conference, TSD 2014
ISBN
9783319108155
ISSN
0302-9743
e-ISSN
—
Number of pages
8
Pages from-to
555-562
Publisher name
Springer-Verlag Berlin Heidelberg
Place of publication
Berlín, Spolková republika Německo
Event location
Brno, Česká Republika
Event date
Jan 1, 2014
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—