All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Automatic Phonetic Segmentation and Pronunciation Detection with Various Approaches of Acoustic Modeling

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21230%2F18%3A00323958" target="_blank" >RIV/68407700:21230/18:00323958 - isvavai.cz</a>

  • Result on the web

    <a href="http://dx.doi.org/10.1007/978-3-319-99579-3_44" target="_blank" >http://dx.doi.org/10.1007/978-3-319-99579-3_44</a>

  • DOI - Digital Object Identifier

    <a href="http://dx.doi.org/10.1007/978-3-319-99579-3_44" target="_blank" >10.1007/978-3-319-99579-3_44</a>

Alternative languages

  • Result language

    angličtina

  • Original language name

    Automatic Phonetic Segmentation and Pronunciation Detection with Various Approaches of Acoustic Modeling

  • Original language description

    The paper describes HMM-based phonetic segmentation realized by KALDI toolkit with the focus on study of accuracy of various acoustic modeling such as GMM-HMM vs. DNN-HMM, monophone vs. triphone, speaker independent vs. speaker dependent. The analysis was performed with TIMIT database and it proved the contribution of advanced acoustic modeling, especially for the choice of a proper pronunciation variant. For this purpose, the lexicon covering the pronunciation variability among TIMIT speakers was created on the basis of phonetic transcriptions available in TIMIT corpus. When the proper sequence of phones is recognized by DNN-HMM system, more precise boundary placement can be then obtained using basic monophone acoustic models.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

  • OECD FORD branch

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Result continuities

  • Project

  • Continuities

    S - Specificky vyzkum na vysokych skolach

Others

  • Publication year

    2018

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Speech and Computer

  • ISBN

    978-3-319-99578-6

  • ISSN

    0302-9743

  • e-ISSN

  • Number of pages

    11

  • Pages from-to

    419-429

  • Publisher name

    Springer

  • Place of publication

    Basel

  • Event location

    Leipzig

  • Event date

    Sep 18, 2018

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article