Automatic dialog act corpus creation from web pages
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F10%3A00503990" target="_blank" >RIV/49777513:23520/10:00503990 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Automatic dialog act corpus creation from web pages
Original language description
This work presents two complementary tools dedicated to the task of textual corpus creation for linguistic researches. The chosen application domain is automatic dialog acts recognition, but the proposed tools might also be applied to any other researcharea that is concerned with dialogs processing. The first software captures relevant dialogs from freely available resources on the World Wide Web. The second software is finally used as a post-processing step to manually check and correct tagging errorswhen needed. We show that reasonably good dialog act labeling accuracy may be achieved, hence greatly reducing the cost of building such corpora.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/2C06009" target="_blank" >2C06009: Complex knowledge base tools for natural language communication with the semantic web</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2010
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
ICEIS 2010, vol. 5, Human-Computer Interaction
ISBN
978-989-8425-08-9
ISSN
—
e-ISSN
—
Number of pages
6
Pages from-to
198-203
Publisher name
SciTelPress - Science and Technology Publications
Place of publication
Setúbal
Event location
Funchal, Madeira, Portugal
Event date
Jun 8, 2010
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—