The role of small corpora in linguistic research

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61988987%3A17250%2F13%3AA14019F3" target="_blank" >RIV/61988987:17250/13:A14019F3 - isvavai.cz</a>
Výsledek na webu
—
DOI - Digital Object Identifier
—

Alternativní jazyky

Jazyk výsledku
němčina
Název v původním jazyce
Zur Rolle von Kleinkorpora in der linguistischen Forschung
Popis výsledku v původním jazyce
Im Kapitel wird der Frage nachgegangen, welchen Beitrag kleine Textkorpora (Textsammlungen im Umfang bis zu einer Million Tokens) neben den großen Korpora leisten können und für welchen Forschungszweck sie geeignet sind. In diesem Zusammenhang werden einige Probleme und Defizite in Hinsicht auf die Vertretung der einzelnen Kommunikationsbereiche und Textsorten in den bereits bestehenden großen Korpora, wie etwa im IDS-Korpus, erwähnt. Des Weiteren werden am Beispiel eines Kleinkorpus mit Wirtschaftstexten verschiedene Kriterien behandelt, die bei der Erstellung von Kleinkorpora in Betracht zu ziehen sind. Das Augenmerk wird dabei auf die Bearbeitung von Primär- und Metadaten gerichtet. Im letzten Teil des Kapitels werden einige frühere Ergebnisse der Analyse des kleinen Wirtschaftskorpus auf ihre Gültigkeit in Bezug auf das IDS-Korpus überprüft.
Název v anglickém jazyce
The role of small corpora in linguistic research
Popis výsledku anglicky
This chapter addresses the question of the role played by small corpora in linguistic research (i.e. corpora with up to one million tokens) as well as large corpora, assessing the purposes for which they are suitable. The chapter mentions a number of issues and deficiencies of such corpora, including the uneven representation, in large corpora such as IDS, of various types of texts (with various text functions) from different fields of professional communication. Using the example of a small corpus of texts on economic topics, the chapter outlines a range of criteria that must be taken into account when compiling such corpora. It also addresses issues related to the processing of primary data and metadata using various software tools. The final sectionfocuses on the question of the validity of research results obtained from small corpora and offers the possibility of analysis using the large IDS corpus in order to verify results from small corpora.

Klasifikace

Druh
C - Kapitola v odborné knize
CEP obor
AI - Jazykověda
OECD FORD obor
—

Návaznosti výsledku

Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Ostatní

Rok uplatnění
2013
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název knihy nebo sborníku
Grundlagen einer sprachwissenschaftlichen Quellenkunde
ISBN
978-3-8233-6836-6
Počet stran výsledku
18
Strana od-do
299-316
Počet stran knihy
382
Název nakladatele
Narr Verlag
Místo vydání
Tübingen
Kód UT WoS kapitoly
—

Podobné výsledky(10)

Parallel forms on -ation and -ierung from corpus linguistic perspective - what shows the significance test?A Compendium of Corpus Lingustics Od korpusu jako otevřeného zdroje pro bádání ke komerčním produktům

Co hledáte?

Rychlé hledání

Chytré vyhledávání

The role of small corpora in linguistic research

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)