Iterative Grapheme-to-Phoneme Alignment for the Training of WFST-based Phonetic Conversion
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F46747885%3A24220%2F13%3A%230002794" target="_blank" >RIV/46747885:24220/13:#0002794 - isvavai.cz</a>
Alternative codes found
RIV/46747885:24220/13:#0002593
Result on the web
<a href="http://dx.doi.org/10.1109/TSP.2013.6613977" target="_blank" >http://dx.doi.org/10.1109/TSP.2013.6613977</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/TSP.2013.6613977" target="_blank" >10.1109/TSP.2013.6613977</a>
Alternative languages
Result language
angličtina
Original language name
Iterative Grapheme-to-Phoneme Alignment for the Training of WFST-based Phonetic Conversion
Original language description
In this paper we propose an algorithm for graphemeto-phoneme (G2P) alignment. Such alignment is needed mainly for the data-driven training of G2P conversion tools. Our approach utilizes a given phonetic alphabet and a set of given orthographic-phonetic word pairs as a source of prior knowledge. The development data are taken from a manually created pronunciation lexicon for a large vocabulary speech recognition system for Czech. The alignment method is based on extended Minimum Edit Distance algorithm.Moreover, we propose an approach to avoid the creation of reference alignments ? we evaluate the improvements through a specially designed G2P converter, i.e. we compare the phonetic transcription directly to a set of test orthographic-phonetic word pairs. Results of our approach are comparable or even slightly better than the state-of-the-art.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/TA01011204" target="_blank" >TA01011204: Living Archives</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2013
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proc. of 36th International Conference on Telecommunications and Signal Processing (TSP 2013)
ISBN
9781479904044
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
474-478
Publisher name
—
Place of publication
—
Event location
Itálie
Event date
Jan 1, 2013
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—