A new dataset for sentence-level complexity in Russian

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F23%3AV8EWMTSK" target="_blank" >RIV/00216208:11320/23:V8EWMTSK - isvavai.cz</a>
Výsledek na webu
<a href="https://www.dialog-21.ru/media/5900/ivanovvpluselbayoumimg086.pdf" target="_blank" >https://www.dialog-21.ru/media/5900/ivanovvpluselbayoumimg086.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.28995/2075-7182-2023-22-181-190" target="_blank" >10.28995/2075-7182-2023-22-181-190</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
A new dataset for sentence-level complexity in Russian
Popis výsledku v původním jazyce
"Text complexity prediction is a well-studied task. Predicting complexity sentence-level has attracted less re-nsearch interest in Russian. One possible application of sentence-level complexity prediction is more precise andnfine-grained modeling of text complexity. In the paper we present a novel dataset with sentence-level annotationnof complexity. The dataset is open and contains 1,200 Russian sentences extracted from SynTagRus treebank.nAnnotations were collected via Yandex Toloka platform using 7-point scale. The paper presents various linguisticnfeatures that can contribute to sentence complexity as well as a baseline linear model."
Název v anglickém jazyce
A new dataset for sentence-level complexity in Russian
Popis výsledku anglicky
"Text complexity prediction is a well-studied task. Predicting complexity sentence-level has attracted less re-nsearch interest in Russian. One possible application of sentence-level complexity prediction is more precise andnfine-grained modeling of text complexity. In the paper we present a novel dataset with sentence-level annotationnof complexity. The dataset is open and contains 1,200 Russian sentences extracted from SynTagRus treebank.nAnnotations were collected via Yandex Toloka platform using 7-point scale. The paper presents various linguisticnfeatures that can contribute to sentence complexity as well as a baseline linear model."

Klasifikace

Druh
D - Stať ve sborníku
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)

Návaznosti výsledku

Projekt
—
Návaznosti
—

Ostatní

Rok uplatnění
2023
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název statě ve sborníku
"Proceedings of the International Conference “Dialogue"
ISBN
—
ISSN
2075-7182
e-ISSN
—
Počet stran výsledku
10
Strana od-do
181-190
Název nakladatele
""
Místo vydání
—
Místo konání akce
Tartu
Datum konání akce
1. 1. 2023
Typ akce podle státní příslušnosti
WRD - Celosvětová akce
Kód UT WoS článku
—

Podobné výsledky(10)

AGRR 2019: Corpus for Gapping Resolution in Russian AGRR 2019: Corpus for Gapping Resolution in Russian Complexity metrics of Russian legal texts: selection, use, initial efficiency evaluation

Co hledáte?

Rychlé hledání

Chytré vyhledávání

A new dataset for sentence-level complexity in Russian

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)