Benefit of Proper Language Processing for Czech Speech Retrieval in the CL-SR Task at CLEF 2006

The result's identifiers

Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F49777513%3A23520%2F07%3A00000303" target="_blank" >RIV/49777513:23520/07:00000303 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—

Alternative languages

Result language
angličtina
Original language name
Benefit of Proper Language Processing for Czech Speech Retrieval in the CL-SR Task at CLEF 2006
Original language description
The paper describes the system built by the team from the University of West Bohemia for participation in the CLEF 2006 CL-SR track. We have decided to concentrate only on the monolingual searching in the Czech test collection and investigate the effectof proper language processing on the retrieval performance. We have employed the Czech morphological analyser and tagger for that purposes. For the actual search system, we have used the classical tf.idf approach with blind relevance feedback as implemented in the Lemur toolkit. The results indicate that a suitable linguistic preprocessing is indeed crucial for the Czech IR performance.
Czech name
Přínos vhodného jazykového předzpracování pro vyhledávání v mluvené češtině v úloze CL-SR na CLEF 2006
Czech description
Článek popisuje systém vytvořený týmem Západočeské univerzity pro účely participace v kampani CLEF 2006 CL-SR track. Rozhodli jsme se soustředit pouze na prohledávání české testovací kolekce a prozkoumání přínosu vhodného jazykového předzpracování pro úspěšnost vyhledávání. Pro účely lingvistického předzpracování dat jsme použili morfologický analyzátor a tagger. Pro vlastní vyhledávání jsme využili klasický tf.idf přístup se slepou zpětnou vazbou tak, jak je implementován v systému Lemur. Výsledky naznačují, že vhodné lingvistické předzpracování je pro úspěšné vyhledávání v mluvené češtině vskutku klíčové.

Classification

Type
D - Article in proceedings
CEP classification
JD - Use of computers, robotics and its application
OECD FORD branch
—

Result continuities

Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

Publication year
2007
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

Article name in the collection
Lecture Notes in Computer Science
ISBN
978-3-540-74998-1
ISSN
—
e-ISSN
—
Number of pages
7
Pages from-to
—
Publisher name
Springer
Place of publication
Berlin
Event location
Alicante
Event date
Sep 20, 2006
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000250568000095

Similar results(10)

The University of West Bohemia at CLEF 2006, the CL-SR track No Free Lunch in Factored Phrase-Based Machine Translation End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

What are you looking for?

Quick search

Smart search

Benefit of Proper Language Processing for Czech Speech Retrieval in the CL-SR Task at CLEF 2006

The result's identifiers

Alternative languages

Classification

Result continuities

Others

Data specific for result type

Similar results(10)

What are you looking for?

Quick search

Smart search

Result description

The result's identifiers

The result's identifiers

Alternative languages

Alternative languages

Classification

Classification

Result continuities

Result continuities

Others

Others

Data specific for result type

Data specific for result type

Similar results(10)