All

What are you looking for?

All
Projects
Results
Organizations

Quick search

  • Projects supported by TA ČR
  • Excellent projects
  • Projects with the highest public support
  • Current projects

Smart search

  • That is how I find a specific +word
  • That is how I leave the -word out of the results
  • “That is how I can find the whole phrase”

ScaleText: The Design of a Scalable, Adaptable and User-Friendly Document System for Similarity Searches : Digging for Nuggets of Wisdom in Text

The result's identifiers

  • Result code in IS VaVaI

    <a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F16%3A00087632" target="_blank" >RIV/00216224:14330/16:00087632 - isvavai.cz</a>

  • Alternative codes found

    RIV/03892620:_____/16:00000001

  • Result on the web

    <a href="http://www.fi.muni.cz/usr/sojka/papers/rygl-sojka-ruzicka-rehurek-raslan2016.pdf" target="_blank" >http://www.fi.muni.cz/usr/sojka/papers/rygl-sojka-ruzicka-rehurek-raslan2016.pdf</a>

  • DOI - Digital Object Identifier

Alternative languages

  • Result language

    angličtina

  • Original language name

    ScaleText: The Design of a Scalable, Adaptable and User-Friendly Document System for Similarity Searches : Digging for Nuggets of Wisdom in Text

  • Original language description

    This paper describes the design of a new ScaleText system aimed at scalable semantic indexing of heterogeneous textual corpora. We discuss the design decisions that lead to a modular system architecture for indexing and searching using semantic vectors of document segments – nuggets of wisdom. The prototype system implementation is evaluated by applying Latent Semantic Indexing (LSI) on the Enron corpus. And the Bpref measure is used to automate comparing the performance of different algorithms and system configurations.

  • Czech name

  • Czech description

Classification

  • Type

    D - Article in proceedings

  • CEP classification

    IN - Informatics

  • OECD FORD branch

Result continuities

  • Project

    <a href="/en/project/TD03000295" target="_blank" >TD03000295: Intelligent software for semantic text search</a><br>

  • Continuities

    P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)

Others

  • Publication year

    2016

  • Confidentiality

    S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Data specific for result type

  • Article name in the collection

    Proceedings of the Tenth Workshop on Recent Advances in Slavonic Natural Language Processing, RASLAN 2016

  • ISBN

    9788026310952

  • ISSN

    2336-4289

  • e-ISSN

  • Number of pages

    9

  • Pages from-to

    79-87

  • Publisher name

    Tribun EU

  • Place of publication

    Brno

  • Event location

    Karlova Studánka

  • Event date

    Dec 2, 2016

  • Type of event by nationality

    EUR - Evropská akce

  • UT code for WoS article