Pronunciation Ambiguities in Japanese Kanji
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AEZFI84TF" target="_blank" >RIV/00216208:11320/23:EZFI84TF - isvavai.cz</a>
Result on the web
<a href="https://aclanthology.org/2023.cawl-1.7/" target="_blank" >https://aclanthology.org/2023.cawl-1.7/</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.18653/v1/2023.cawl-1.7" target="_blank" >10.18653/v1/2023.cawl-1.7</a>
Alternative languages
Result language
angličtina
Original language name
Pronunciation Ambiguities in Japanese Kanji
Original language description
"Japanese writing is a complex system, and a large part of the complexity resides in the use of kanji. A single kanji character in modern Japanese may have multiple pronunciations, either as native vocabulary or as words borrowed from Chinese. This causes a problem for text-to-speech synthesis (TTS) because the system has to predict which pronunciation of each kanji character is appropriate in the context. The problem is called homograph disambiguation. To solve the problem, this research provides a new annotated Japanese single kanji character pronunciation data set and describes an experiment using the logistic regression (LR) classifier. A baseline is computed to compare with the LR classifier accuracy. This experiment provides the first experimental research in Japanese single kanji homograph disambiguation. The annotated Japanese data is freely released to the public to support further work."
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
—
Continuities
—
Others
Publication year
2023
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
"Proceedings of the Workshop on Computation and Written Language"
ISBN
978-1-959429-90-6
ISSN
—
e-ISSN
—
Number of pages
11
Pages from-to
50-60
Publisher name
ACL
Place of publication
Aarhus, Denmark
Event location
Aarhus, Denmark
Event date
Jan 1, 2023
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—