The Open Cantonese Sense-Tagged Corpus
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989592%3A15210%2F23%3A73624811" target="_blank" >RIV/61989592:15210/23:73624811 - isvavai.cz</a>
Result on the web
<a href="https://aclanthology.org/2023.gwc-1.32/" target="_blank" >https://aclanthology.org/2023.gwc-1.32/</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
The Open Cantonese Sense-Tagged Corpus
Original language description
This paper introduces the Open Cantonese Sense-Tagged Corpus, a new and ongoing project to serve as the companion to the development of the Cantonese Wordnet. This corpus is built on top of the Cantonese Wordnet Corpus, which currently provides example sentences for most verbs in this wordnet. This paper motivates the choice of starting a sense-tagged corpus from both linguistic and educational perspectives, and discusses the current solutions to issues arisen from the sensetagging exercise. In total, we have tagged over 5,000 concepts, with more than 3,700 direct links to the Cantonese Wordnet.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
60203 - Linguistics
Result continuities
Project
—
Continuities
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 12th Global Wordnet Conference, pages 263–268, University of the Basque Country, Donostia - San Sebastian, Basque Country
ISBN
978-84-09-53956-7
ISSN
—
e-ISSN
—
Number of pages
6
Pages from-to
263-268
Publisher name
Global Wordnet Association
Place of publication
Donostia-San Sebastian
Event location
Donostia - San Sebastian
Event date
Jan 23, 2023
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—