Formální reprezentace jazykových struktur

Název projektu anglicky
Formal representation of language structures
Anotace anglicky
Natural Language Processing ( NLP ) of the Czech language is one of the high priority tasks of Czech linguistics. It presupposes a specification of the formal representation of language structures to be built which will be used as a target representationfor sentence analysis ( of running texts, queries etc. ) as well as a source representation for next generation ( as a part of machine translation systems, abstract generation systems, question answering systems etc.). Such representation must be formally correct, nonredundant ( as much as possible and transparent from the interpretation point of view. They must reflect the structural properties of Czech and other Indo - European languages, and be based on empirical language studies. We assume that anunprecedented, large scale evaluation will take place during the project to ensure proper feedback. NLP is in the centre of interest today. It is therefore necessary to follow the developments in the NLP community at large. From the scientific point of

Kategorie VaV
—
CEP - hlavní obor
AI - Jazykověda
CEP - vedlejší obor
AF - Dokumentace, knihovnictví, práce s informacemi
CEP - další vedlejší obor
BD - Teorie informace
OECD FORD - odpovídající obory (dle <a href="http://www.vyzkum.cz/storage/att/E6EF7938F0E854BAE520AC119FB22E8D/Prevodnik_oboru_Frascati.pdf">převodníku</a>)
10102 - Applied mathematics 10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8) 50803 - Information science (social aspects) 50804 - Library science 60201 - General language studies 60202 - Specific languages 60203 - Linguistics

Hodnocení poskytovatelem
V - Vynikající výsledky projektu (s mezinárodním významem atd.)
Zhodnocení výsledků projektu
V rámci projektu byl vytvořen tzv.Pražský závislostní korpus, obsahující 30 000 běžných českých vět označkovaných na morfologické a analytické rovině. Výsledky projektu jsou pro další počítačové zpracování češtiny velmi významné. Mají mezinárodní paramet

Důvěrnost údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Systémové označení dodávky dat
CEP/1999/GA0/GA09GA/V/6:6
Datum dodání záznamu
—

Podobné projekty(10)