SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AP69NL2NX" target="_blank" >RIV/00216208:11320/23:P69NL2NX - isvavai.cz</a>
Výsledek na webu
<a href="https://link.springer.com/10.1007/s10579-022-09615-2" target="_blank" >https://link.springer.com/10.1007/s10579-022-09615-2</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/s10579-022-09615-2" target="_blank" >10.1007/s10579-022-09615-2</a>

Jazyk výsledku
angličtina
Název v původním jazyce
SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain
Popis výsledku v původním jazyce
"Word segmentation, part-of-speech (POS) tagging, and syntactic parsing are three fundamental Chinese analysis tasks for Chinese language processing, which are also crucial for various downstream tasks such as machine translation and information extraction. To achieve high accuracy for these tasks, treebanks that contain sentences manually annotated with word segmentation, part-of-speech tags, and phrase structures are essential. Although there are large-scale Chinese treebanks in the news domain, such treebanks are unavailable in the scientific domain."
Název v anglickém jazyce
SCTB-V2: the 2nd version of the Chinese treebank in the scientific domain
Popis výsledku anglicky
"Word segmentation, part-of-speech (POS) tagging, and syntactic parsing are three fundamental Chinese analysis tasks for Chinese language processing, which are also crucial for various downstream tasks such as machine translation and information extraction. To achieve high accuracy for these tasks, treebanks that contain sentences manually annotated with word segmentation, part-of-speech tags, and phrase structures are essential. Although there are large-scale Chinese treebanks in the news domain, such treebanks are unavailable in the scientific domain."

Druh
J<sub>ost</sub> - Ostatní články v recenzovaných periodicích
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Podobné výsledky(10)