Fast syntactic searching in very large corpora for many languages
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F10%3A00045408" target="_blank" >RIV/00216224:14330/10:00045408 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Fast syntactic searching in very large corpora for many languages
Original language description
For many linguistic investigations, the first step is to find examples. In the 21st century, they should all be found, not invented. Thus linguists need flexible tools for finding even quite rare phenomena. To support linguists well, they need to be fasteven where corpora are very large and queries are complex. We present extensions to the CQL ("Corpus Query Language") for intuitive creation of syntactically rich queries, and demonstrate that they can be computed quickly within our tool even on multi-billion word corpora.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach<br>R - Projekt Ramcoveho programu EK
Others
Publication year
2010
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
PACLIC 24 Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation
ISBN
978-4-905166-00-9
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
—
Publisher name
Waseda University
Place of publication
Tokyo
Event location
Sendai, Japonsko
Event date
Nov 4, 2010
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—