Thematic Concentration and Vocabulary Richness
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61988987%3A17250%2F16%3AA1701L8Y" target="_blank" >RIV/61988987:17250/16:A1701L8Y - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Thematic Concentration and Vocabulary Richness
Original language description
The contribution investigates a relation between two stylometric features with promising results in text classification: thematic concentration and vocabulary richness. Namely secondary thematic concentration (STC), moving average type-token ratio (MATTR), and repeat rate (RRMC) are analysed. The main aim is to test the hypothesis that vocabulary richness negatively correlates with thematic concentration. The research is based on a corpus of more than 900 English texts from various genres. This study follows up a similar analysis (Čech 2016) which investigated Czech texts.
Czech name
—
Czech description
—
Classification
Type
C - Chapter in a specialist book
CEP classification
AI - Linguistics
OECD FORD branch
—
Result continuities
Project
—
Continuities
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Book/collection name
Issues in Quantitative Linguistics 4
ISBN
978-3-942303-44-6
Number of pages of the result
10
Pages from-to
150-159
Number of pages of the book
300
Publisher name
RAM-Verlag
Place of publication
Lüdenscheid
UT code for WoS chapter
—