Vícejazyčná automatická detekce strukturálních událostí v mluvené řeči

Název projektu anglicky
Multilingual Automatic Detection of Structural Events in Speech
Anotace anglicky
This project aims to support a closer cooperation between Department of Cybernetics, University of West Bohemia in Pilsen, the Human Language Technology Research Institute, University of Texas at Dallas, and SRI International, in the area of automatic speech understanding. In particular, the project is focused on automatic detection of structural events in speech, which is a key task for enabling downstream automatic processing of automatically recognized text. Structural events include sentence boundaries, disfluencies, and other phenomena that are currently not marked in the ?stream of words? output by conventional speech recognizers. The main goal is to develop methods for automatic detection of structural events in audio documents in different languages, including languages that differ significantly in phonetic, prosodic, and syntactic characteristics. We plan to mainly work on Czech and English, but we also plan to extend the work to Arabic and Mandarin, and eventually to other languages.

Kategorie VaV
ZV - Základní výzkum
CEP - hlavní obor
JD - Využití počítačů, robotika a její aplikace
CEP - vedlejší obor
—
CEP - další vedlejší obor
—
OECD FORD - odpovídající obory <br>(dle <a href="http://www.vyzkum.cz/storage/att/E6EF7938F0E854BAE520AC119FB22E8D/Prevodnik_oboru_Frascati.pdf">převodníku</a>)
20204 - Robotics and automatic control<br>20205 - Automation and control systems

Hodnocení poskytovatelem
V - Vynikající výsledky projektu (s mezinárodním významem atd.)
Zhodnocení výsledků projektu
V projektu byla vytvořena česká řečová databáze s anotací tzv. strukturálních metadat. Byl vyvinut automatický systém pro detekci konce věty v řeči. Úspěšnost systému byla analyzována z pohledu vlivu jazyka, žánru řeči a identity řečníka. Byla rozvíjena?

Důvěrnost údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Systémové označení dodávky dat
CEP10-MSM-ME-U/01:1
Datum dodání záznamu
30. 6. 2010

Cíle projektu