All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

Automatic dialog act corpus creation from web pages

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F10%3A00503990" target="_blank" >RIV/49777513:23520/10:00503990 - isvavai.cz</a>

  • Result on the web

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    Automatic dialog act corpus creation from web pages

  • Original language description

    This work presents two complementary tools dedicated to the task of textual corpus creation for linguistic researches. The chosen application domain is automatic dialog acts recognition, but the proposed tools might also be applied to any other researcharea that is concerned with dialogs processing. The first software captures relevant dialogs from freely available resources on the World Wide Web. The second software is finally used as a post-processing step to manually check and correct tagging errorswhen needed. We show that reasonably good dialog act labeling accuracy may be achieved, hence greatly reducing the cost of building such corpora.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    IN - Informatics

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/2C06009" target="_blank" >2C06009: Complex knowledge base tools for natural language communication with the semantic web</a><br>

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2010

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    ICEIS 2010, vol. 5, Human-Computer Interaction

  • ISBN

    978-989-8425-08-9

  • ISSN

  • e-ISSN

  • Number of pages

    6

  • Pages from-to

    198-203

  • Publisher name

    SciTelPress - Science and Technology Publications

  • Place of publication

    Setúbal

  • Event location

    Funchal, Madeira, Portugal

  • Event date

    Jun 8, 2010

  • Type of event by nationality

    WRD - Celosvětová akce

  • UT code for WoS article