Vše

Co hledáte?

Vše
Projekty
Výsledky výzkumu
Subjekty

Rychlé hledání

  • Projekty podpořené TA ČR
  • Významné projekty
  • Projekty s nejvyšší státní podporou
  • Aktuálně běžící projekty

Chytré vyhledávání

  • Takto najdu konkrétní +slovo
  • Takto z výsledků -slovo zcela vynechám
  • “Takto můžu najít celou frázi”

A Neural Approach to Discourse Relation Signal Detection

Identifikátory výsledku

  • Kód výsledku v IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F20%3A10426929" target="_blank" >RIV/00216208:11320/20:10426929 - isvavai.cz</a>

  • Výsledek na webu

    <a href="https://journals.uic.edu/ojs/index.php/dad/article/view/11372" target="_blank" >https://journals.uic.edu/ojs/index.php/dad/article/view/11372</a>

  • DOI - Digital Object Identifier

Alternativní jazyky

  • Jazyk výsledku

    angličtina

  • Název v původním jazyce

    A Neural Approach to Discourse Relation Signal Detection

  • Popis výsledku v původním jazyce

    Previous data-driven work investigating the types and distributions of discourse relation signals, including discourse markers such as &apos;however&apos; or phrases such as &apos;as a result&apos; has focused on the relative frequencies of signal words within and outside text from each discourse relation. Such approaches do not allow us to quantify the signaling strength of individual instances of a signal on a scale (e.g. more or less discourse-relevant instances of &apos;and&apos;), to assess the distribution of ambiguity for signals, or to identify words that hinder discourse relation identification in context (&apos;anti-signals&apos; or &apos;distractors&apos;). In this paper we present a data-driven approach to signal detection using a distantly supervised neural network and develop a metric, Δs (or &apos;delta-softmax&apos;), to quantify signaling strength. Ranging between -1 and 1 and relying on recent advances in contextualized words embeddings, the metric represents each word&apos;s positive or negative contribution to the identifiability of a relation in specific instances in context. Based on an English corpus annotated for discourse relations using Rhetorical Structure Theory and signal type annotations anchored to specific tokens, our analysis examines the reliability of the metric, the places where it overlaps with and differs from human judgments, and the implications for identifying features that neural models may need in order to perform better on automatic discourse relation classification.

  • Název v anglickém jazyce

    A Neural Approach to Discourse Relation Signal Detection

  • Popis výsledku anglicky

    Previous data-driven work investigating the types and distributions of discourse relation signals, including discourse markers such as &apos;however&apos; or phrases such as &apos;as a result&apos; has focused on the relative frequencies of signal words within and outside text from each discourse relation. Such approaches do not allow us to quantify the signaling strength of individual instances of a signal on a scale (e.g. more or less discourse-relevant instances of &apos;and&apos;), to assess the distribution of ambiguity for signals, or to identify words that hinder discourse relation identification in context (&apos;anti-signals&apos; or &apos;distractors&apos;). In this paper we present a data-driven approach to signal detection using a distantly supervised neural network and develop a metric, Δs (or &apos;delta-softmax&apos;), to quantify signaling strength. Ranging between -1 and 1 and relying on recent advances in contextualized words embeddings, the metric represents each word&apos;s positive or negative contribution to the identifiability of a relation in specific instances in context. Based on an English corpus annotated for discourse relations using Rhetorical Structure Theory and signal type annotations anchored to specific tokens, our analysis examines the reliability of the metric, the places where it overlaps with and differs from human judgments, and the implications for identifying features that neural models may need in order to perform better on automatic discourse relation classification.

Klasifikace

  • Druh

    O - Ostatní výsledky

  • CEP obor

  • OECD FORD obor

    10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

  • Projekt

  • Návaznosti

Ostatní

  • Rok uplatnění

    2020

  • Kód důvěrnosti údajů

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů