Prak: An automatic phonetic alignment tool for Czech
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F23%3A10474535" target="_blank" >RIV/00216208:11210/23:10474535 - isvavai.cz</a>
Result on the web
<a href="https://guarant.cz/icphs2023/525.pdf" target="_blank" >https://guarant.cz/icphs2023/525.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Prak: An automatic phonetic alignment tool for Czech
Original language description
Labeling speech down to the identity and time boundaries of phones is a labor-intensive part of phonetic research. To simplify this work, we created a free open-source tool generating phone sequences from Czech text and time-aligning them with audio.Low architecture complexity makes the design approachable for students of phonetics. Acoustic model ReLU NN with 56k weights was trained using PyTorch on small CommonVoice data. Alignment and variant selection decoder is implemented in Python with matrix library.A Czech pronunciation generator is composed of simple rule-based blocks capturing the logic of the language where possible, allowing modification of transcription approach details.Compared to tools used until now, data preparation efficiency improved, the tool is usable on Mac, Linux and Windows in Praat GUI or command line, achieves mostly correct pronunciation variant choice including glottal stop detection, algorithmically captures most of Czech assimilation logic and is both didactic and practical.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
60203 - Linguistics
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach<br>I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
20th International Congress of Phonetic Sciences (ICPhS)
ISBN
978-80-908114-2-3
ISSN
—
e-ISSN
—
Number of pages
5
Pages from-to
3121-3125
Publisher name
Guarant International
Place of publication
Prague, Czech Republic
Event location
Prague Congress Center, Czech Republic
Event date
Aug 7, 2023
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—