Coping with unruly language: non-standard usage in a corpus
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11210%2F18%3A10385135" target="_blank" >RIV/00216208:11210/18:10385135 - isvavai.cz</a>
Result on the web
<a href="http://vg07.met.vgwort.de/na/612d2e2820e34c5cae7564131e335a91?l=https://heiup.uni-heidelberg.de/reader/download/361/361-69-81153-2-10-20180606.pdf" target="_blank" >http://vg07.met.vgwort.de/na/612d2e2820e34c5cae7564131e335a91?l=https://heiup.uni-heidelberg.de/reader/download/361/361-69-81153-2-10-20180606.pdf</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.17885/heiup.361.509" target="_blank" >10.17885/heiup.361.509</a>
Alternative languages
Result language
angličtina
Original language name
Coping with unruly language: non-standard usage in a corpus
Original language description
A language as used in real situations may differ substantially from its standard form. Before the entire range of NLP methods and tools can be applied to non-canonical variants of a language, appropriate categories for the analysis of deviant forms and constructions are needed, together with texts annotated by these categories. A discussion of non-standard language is followed by two case studies. The first study proposes a taxonomy of morphosyntactic categories as an attempt to analyze non-standard forms in non-native learners' Czech. The second study focuses on the role of a rule-based grammar. and lexicon as tools for the detection and diagnostics of non-standard words and constructions in the process of building and using a parsebank.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
60203 - Linguistics
Result continuities
Project
<a href="/en/project/GA16-10185S" target="_blank" >GA16-10185S: Non-native Czech from the Theoretical and Computational Perspective</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2018
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Grammar and Corpora 2016
ISBN
978-3-946054-84-9
ISSN
—
e-ISSN
neuvedeno
Number of pages
17
Pages from-to
271-287
Publisher name
Heidelberg University Publishing
Place of publication
Heidelberg
Event location
Mannheim
Event date
Nov 8, 2016
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—