Can Corpus Pattern Analysis Be Used in NLP?
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F10%3A10078001" target="_blank" >RIV/00216208:11320/10:10078001 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Can Corpus Pattern Analysis Be Used in NLP?
Popis výsledku v původním jazyce
Corpus Pattern Analysis (CPA) [4], coined and implemented by Hanks as the Pattern Dictionary of English Verbs (PDEV) [3], appears to be the only deliberate and consistent implementation of Sinclair's concept of Lexical Item [12]. In his theoretical inquiries [5] Hanks hypothesizes that the pattern repository produced by CPA can also support the word sense disambiguation task. Although more than 670 verb entries have already been compiled in PDEV, no systematic evaluation of this ambitious project has been reported yet. Assuming that the Sinclairian concept of the Lexical Item is correct, we started to closely examine PDEV with its possible NLP application in mind. Our experiments presented in this paper have been performed on a pilot sample of Englishverbs to provide a first reliable view on whether humans can agree in assigning PDEV patterns to verbs in a corpus. As a conclusion we suggest procedures for future development of PDEV.
Název v anglickém jazyce
Can Corpus Pattern Analysis Be Used in NLP?
Popis výsledku anglicky
Corpus Pattern Analysis (CPA) [4], coined and implemented by Hanks as the Pattern Dictionary of English Verbs (PDEV) [3], appears to be the only deliberate and consistent implementation of Sinclair's concept of Lexical Item [12]. In his theoretical inquiries [5] Hanks hypothesizes that the pattern repository produced by CPA can also support the word sense disambiguation task. Although more than 670 verb entries have already been compiled in PDEV, no systematic evaluation of this ambitious project has been reported yet. Assuming that the Sinclairian concept of the Lexical Item is correct, we started to closely examine PDEV with its possible NLP application in mind. Our experiments presented in this paper have been performed on a pilot sample of Englishverbs to provide a first reliable view on whether humans can agree in assigning PDEV patterns to verbs in a corpus. As a conclusion we suggest procedures for future development of PDEV.
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
AI - Jazykověda
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/GAP406%2F10%2F0875" target="_blank" >GAP406/10/0875: Komputační lingvistika: Explicitní popis jazyka a anotovaná data se zřetelem na češtinu</a><br>
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)<br>S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2010
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Lecture Notes in Computer Science
ISSN
0302-9743
e-ISSN
—
Svazek periodika
2010
Číslo periodika v rámci svazku
6231
Stát vydavatele periodika
DE - Spolková republika Německo
Počet stran výsledku
8
Strana od-do
—
Kód UT WoS článku
—
EID výsledku v databázi Scopus
—