ScaleText: The Design of a Scalable, Adaptable and User-Friendly Document System for Similarity Searches : Digging for Nuggets of Wisdom in Text
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F16%3A00087632" target="_blank" >RIV/00216224:14330/16:00087632 - isvavai.cz</a>
Alternative codes found
RIV/03892620:_____/16:00000001
Result on the web
<a href="http://www.fi.muni.cz/usr/sojka/papers/rygl-sojka-ruzicka-rehurek-raslan2016.pdf" target="_blank" >http://www.fi.muni.cz/usr/sojka/papers/rygl-sojka-ruzicka-rehurek-raslan2016.pdf</a>
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
ScaleText: The Design of a Scalable, Adaptable and User-Friendly Document System for Similarity Searches : Digging for Nuggets of Wisdom in Text
Original language description
This paper describes the design of a new ScaleText system aimed at scalable semantic indexing of heterogeneous textual corpora. We discuss the design decisions that lead to a modular system architecture for indexing and searching using semantic vectors of document segments – nuggets of wisdom. The prototype system implementation is evaluated by applying Latent Semantic Indexing (LSI) on the Enron corpus. And the Bpref measure is used to automate comparing the performance of different algorithms and system configurations.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/TD03000295" target="_blank" >TD03000295: Intelligent software for semantic text search</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of the Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016
ISBN
9788026310952
ISSN
2336-4289
e-ISSN
—
Number of pages
9
Pages from-to
79-87
Publisher name
Tribun EU
Place of publication
Brno
Event location
Karlova Studánka
Event date
Dec 2, 2016
Type of event by nationality
EUR - Evropská akce
UT code for WoS article
—