Speech Technology for Unwritten Languages
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F20%3APU140040" target="_blank" >RIV/00216305:26230/20:PU140040 - isvavai.cz</a>
Result on the web
<a href="https://ieeexplore.ieee.org/document/8998182" target="_blank" >https://ieeexplore.ieee.org/document/8998182</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/TASLP.2020.2973896" target="_blank" >10.1109/TASLP.2020.2973896</a>
Alternative languages
Result language
angličtina
Original language name
Speech Technology for Unwritten Languages
Original language description
Abstract-Speech technology plays an important role in our everyday life. Among others, speech is used for human-computer interaction, for instance for information retrieval and on-line shopping. In the case of an unwritten language, however, speech technology is unfortunately difficult to create, because it cannot be created by the standard combination of pre-trained speech-to-text and text-to-speech subsystems. The research presented in this article takes the first steps towards speech technology for unwritten languages. Specifically, the aim of this work was 1) to learn speech-to-meaning representations without using text as an intermediate representation, and 2) to test the sufficiency of the learned representations to regenerate speech or translated text, or to retrieve images that depict the meaning of an utterance in an unwritten language. The results suggest that building systems that go directly from speech-to-meaning and from meaning-to-speech, bypassing the need for text, is possible.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING
ISSN
2329-9290
e-ISSN
2329-9304
Volume of the periodical
2020
Issue of the periodical within the volume
28
Country of publishing house
US - UNITED STATES
Number of pages
12
Pages from-to
964-975
UT code for WoS article
000522357500002
EID of the result in the Scopus database
2-s2.0-85079642575