Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F16%3A10335455" target="_blank" >RIV/00216208:11320/16:10335455 - isvavai.cz</a>
Alternative codes found
RIV/00216224:14330/16:00090038
Result on the web
<a href="http://www.lrec-conf.org/proceedings/lrec2016/pdf/506_Paper.pdf" target="_blank" >http://www.lrec-conf.org/proceedings/lrec2016/pdf/506_Paper.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Graded and Word-Sense-Disambiguation Decisions in Corpus Pattern Analysis: a Pilot Study
Original language description
We present a pilot analysis of a new linguistic resource, VPS-GradeUp (available at http://hdl.handle.net/11234/1-1585). The resource contains 11,400 graded human decisions on usage patterns of 29 English lexical verbs, randomly selected from the Pattern Dictionary of English Verbs (Hanks, 2000 2014) based on their frequency and the number of senses their lemmas have in PDEV. This data set has been created to observe the interannotator agreement on PDEV patterns produced using the Corpus Pattern Analysis (Hanks, 2013). Apart from the graded decisions, the data set also contains traditional Word-Sense-Disambiguation (WSD) labels. We analyze the associations between the graded annotation and WSD annotation. The results of the respective annotations do not correlate with the size of the usage pattern inventory for the respective verbs lemmas, which makes the data set worth further linguistic analysis.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
ISBN
978-2-9517408-9-1
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
848-854
Publisher name
European Language Resources Association
Place of publication
Paris, France
Event location
Portorož, Slovenia
Event date
May 23, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—