Filters
Introducing the CURLICAT Corpora: Seven-language Domain Specific Annotated Corpora from Curated Sources
, which aims to collect and deeply annotate a set of large corpora from selected domains. The CURLICAT corpus includes 7 monolingual corpora (Bulgarian, Croatian respective national corpora. These corpora ...
Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
- 2022 •
- D •
- Link
Rok uplatnění
D - Stať ve sborníku
Výsledek na webu
A multi-lingual and cross-domain analysis of features for text simplification
, Spanish, and Italian text simplification corpora. Our multi-lingual and multi-domain simplification is different per corpora, language, and domain. For example, the relevance domains, and 14 features wit...
Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
- 2020 •
- O •
- Link
Rok uplatnění
O - Ostatní výsledky
Výsledek na webu
Translating the Language of Aviation. The Development and Detailed analysis of the English-Bengali Aviation Corpus for Machine translation
and common domains exhausted and things of the past, Modern fields of research corpora domains lie anywhere between medicines to aero-science. The Work becomes more touch such as Aeronautics and Aviation. With corpora<...
Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
- 2023 •
- Jost •
- Link
Rok uplatnění
Jost - Ostatní články v recenzovaných periodicích
Výsledek na webu
FJWU participation for the WMT20 Biomedical Translation Task
the effects of adding in-domain corpora extracted from various out-of-domain sources. Systems were built for French to English using in-domain corpora through fine tuning on effect of domain adap...
Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
- 2020 •
- O •
- Link
Rok uplatnění
O - Ostatní výsledky
Výsledek na webu
Finding Terms in Corpora for Many Languages with the Sketch Engine
Term candidates for a domain, in a language, can be found by taking a corpus for the domain, and a refer- ence corpus for the language identifying the grammatical shape of a term in the language tokenising, lemmatising and POS-taggi...
IN - Informatika
- 2014 •
- D •
- Link
Rok uplatnění
D - Stať ve sborníku
Výsledek na webu
WebBootCaT: instant domain-specific corpora to support human translators
We present a web service to aid translators by quickly producing corpora for specialist areas, in any of a range of languages, from the web. The underlying BootCaT query tool, for further exploration. Reference corpora are used to i...
IN - Informatika
- 2006 •
- D
Rok uplatnění
D - Stať ve sborníku
Power Networks Dialogs - Enhancing Domain-Specific Text Processing Techniques and Resources
of the EPN resources. The new data represent one of the largest domain specific corporaIn this paper, we describe the process of development of the analytical approaches adapted for the work with technical texts specialized at the ...
IN - Informatika
- 2008 •
- D
Rok uplatnění
D - Stať ve sborníku
Semantic modelling in corpora and dialogue systems
The paper deals with semantic modeling, building semantic roles and semantic hierarchies in specific domains within development of computerized dialogue system. The used abstract levels and parser functionality are mentioned....
JD - Využití počítačů, robotika a její aplikace
- 2004 •
- D
Rok uplatnění
D - Stať ve sborníku
Using Language Models to Improve Rule-based Linguistic Annotation of Modern Historical Japanese Corpora
Annotation of unlabeled textual corpora with linguistic metadata texts and current methods lack mechanisms that enable helpful evaluations by domain, this paper proposes the use of unsupervised domain adaptation methods to ...
Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
- 2022 •
- D •
- Link
Rok uplatnění
D - Stať ve sborníku
Výsledek na webu
Design and Development of Speech Corpora for Air Traffic Control Training
The paper describes the process of creation of domain-specific speech corpora containing air traffic control (ATC) communication prompts. Since the ATC domain and annotate data from this specific domain. Actually, ...
Automation and control systems
- 2019 •
- D •
- Link
Rok uplatnění
D - Stať ve sborníku
Výsledek na webu
- 1 - 10 out of 20 072