Can Corpus Pattern Analysis Be Used in NLP?
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F10%3A10078001" target="_blank" >RIV/00216208:11320/10:10078001 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Can Corpus Pattern Analysis Be Used in NLP?
Original language description
Corpus Pattern Analysis (CPA) [4], coined and implemented by Hanks as the Pattern Dictionary of English Verbs (PDEV) [3], appears to be the only deliberate and consistent implementation of Sinclair's concept of Lexical Item [12]. In his theoretical inquiries [5] Hanks hypothesizes that the pattern repository produced by CPA can also support the word sense disambiguation task. Although more than 670 verb entries have already been compiled in PDEV, no systematic evaluation of this ambitious project has been reported yet. Assuming that the Sinclairian concept of the Lexical Item is correct, we started to closely examine PDEV with its possible NLP application in mind. Our experiments presented in this paper have been performed on a pilot sample of Englishverbs to provide a first reliable view on whether humans can agree in assigning PDEV patterns to verbs in a corpus. As a conclusion we suggest procedures for future development of PDEV.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/GAP406%2F10%2F0875" target="_blank" >GAP406/10/0875: Computational Linguistics: Explicit description of language and annotated data focused on Czech</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>Z - Vyzkumny zamer (s odkazem do CEZ)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2010
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Lecture Notes in Computer Science
ISSN
0302-9743
e-ISSN
—
Volume of the periodical
2010
Issue of the periodical within the volume
6231
Country of publishing house
DE - GERMANY
Number of pages
8
Pages from-to
—
UT code for WoS article
—
EID of the result in the Scopus database
—