The use of English, Czech and French punctuation marks in Reference, Parallel and Comparable Web Corpora: A Question of Methodology
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F20%3A10392106" target="_blank" >RIV/00216208:11210/20:10392106 - isvavai.cz</a>
Result on the web
<a href="https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=NcyjoutD9T" target="_blank" >https://verso.is.cuni.cz/pub/verso.fpl?fname=obd_publikace_handle&handle=NcyjoutD9T</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.14712/18059635.2020.1.2" target="_blank" >10.14712/18059635.2020.1.2</a>
Alternative languages
Result language
angličtina
Original language name
The use of English, Czech and French punctuation marks in Reference, Parallel and Comparable Web Corpora: A Question of Methodology
Original language description
This paper analyses the frequency of six punctuation marks (the comma, period, colon, semicolon, question mark and exclamation mark) in three languages (English, French and Czech) in three different types of corpora - comparable web corpora, large monolingual general (reference) corpora and parallel (translation) corpora. The aim of the analysis is to identify which type of corpus and which methodology are the most suitable for contrastive research into punctuation. The data shows that the frequency of different punctuation marks is very sensitive to the text type. Therefore, the web corpora, containing uncontrollable amounts of various text types, cannot provide specific and reliable information about the use of punctuation marks in a given language. We argue that despite their limitations in terms of size and composition as well as the potential specific features of the language of translation, the parallel corpora used in combination with the general (reference) corpora, provide the best data for such research.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
60203 - Linguistics
Result continuities
Project
<a href="/en/project/EF16_019%2F0000734" target="_blank" >EF16_019/0000734: Creativity and Adaptability as Conditions of the Success of Europe in an Interrelated World</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Linguistica Pragensia [online]
ISSN
1805-9635
e-ISSN
—
Volume of the periodical
30
Issue of the periodical within the volume
2
Country of publishing house
CZ - CZECH REPUBLIC
Number of pages
21
Pages from-to
30-50
UT code for WoS article
000534610700002
EID of the result in the Scopus database
2-s2.0-85091179508