Language Resources for Intelligent Processing of Dialogues about Electrical Networks
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F06%3A00015281" target="_blank" >RIV/00216224:14330/06:00015281 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Language Resources for Intelligent Processing of Dialogues about Electrical Networks
Original language description
The paper describes the process of designing a natural language dialogue interface for querying large databases with time data about electrical power network failures. The first stage of implementation of such dialogue interface consists of creation andpreparation of several auxiliary resources that are required for natural language processing of texts over this specific domain. All modern methods of automatic input analysis of texts covering a domain with special terminology are based on a collectionof large amount of texts from the field, so called textual corpus. We describe the process and statistical results of creation of a corpus of electrical power networks texts consisting of more than 100.000 of positions (words and marks). We also offer some preliminary results of syntactical analysis of these texts. In the last part of this paper, we present the design of a dialogue system based on the analysis techniques using the corpus data that will allow natural language queries (in
Czech name
Jazykové zdroje pro inteligentní zpracování dialogů o elektrických sítích
Czech description
The paper describes the process of designing a natural language dialogue interface for querying large databases with time data about electrical power network failures. The first stage of implementation of such dialogue interface consists of creation andpreparation of several auxiliary resources that are required for natural language processing of texts over this specific domain. All modern methods of automatic input analysis of texts covering a domain with special terminology are based on a collectionof large amount of texts from the field, so called textual corpus. We describe the process and statistical results of creation of a corpus of electrical power networks texts consisting of more than 100.000 of positions (words and marks). We also offer some preliminary results of syntactical analysis of these texts. In the last part of this paper, we present the design of a dialogue system based on the analysis techniques using the corpus data that will allow natural language queries (in
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/1ET100300414" target="_blank" >1ET100300414: Intelligent methods for incresing of reliability of electrical networks</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2006
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of ElNet 2005
ISBN
80-248-0975-3
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
42-49
Publisher name
VŠB TU Ostrava
Place of publication
Ostrava
Event location
Ostrava
Event date
Dec 14, 2005
Type of event by nationality
CST - Celostátní akce
UT code for WoS article
—