Automatic Acquisition of Semantics-Extraction Patterns
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F06%3APU67261" target="_blank" >RIV/00216305:26230/06:PU67261 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Automatic Acquisition of Semantics-Extraction Patterns
Original language description
This paper examines the use of parallel and comparable corpora for automatic acquisition of semantics-extraction patterns. It presents a new method of the pattern extraction which takes advantage of parallel texts to "port" text mining solutions from a source language to a target language. It is shown that the technique can help in situations when the extraction procedure is to be applied in a language (languages) with a limited set of available resources, e.g. domain-specific thesauri. The primary motivation of our work lies in a particular multilingual e-learning system. For testing purposes, other applications of the given approach were implemented. They include pattern extraction from general texts (tested on wordnet relations), acquisition of domain-specific patterns from large parallel corpus of legal EU documents, and mining of subjectivity expressions for multilingual opinion extraction system.
Czech name
Automatic Acquisition of Semantics-Extraction Patterns
Czech description
Příspěvek se zabývá použitím paralelních korpusů pro automatickou extrakci lexiko-sémantických vzorů.<br>
Classification
Type
D - Article in proceedings
CEP classification
JC - Computer hardware and software
OECD FORD branch
—
Result continuities
Project
—
Continuities
S - Specificky vyzkum na vysokych skolach
Others
Publication year
2006
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the 5th International Conference on Language Resources and Evaluation
ISBN
2-9517408-2-4
ISSN
—
e-ISSN
—
Number of pages
4
Pages from-to
1-4
Publisher name
European Language Resources Association
Place of publication
Paris
Event location
Genoa
Event date
Jul 11, 2006
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—