Large-scale similarity data management with distributed Metric Index
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F12%3A00057505" target="_blank" >RIV/00216224:14330/12:00057505 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1016/j.ipm.2010.12.004" target="_blank" >http://dx.doi.org/10.1016/j.ipm.2010.12.004</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1016/j.ipm.2010.12.004" target="_blank" >10.1016/j.ipm.2010.12.004</a>
Alternative languages
Result language
angličtina
Original language name
Large-scale similarity data management with distributed Metric Index
Original language description
Metric space is a universal and versatile model of similarity that can be applied in various areas of non-text information retrieval. However, a general, efficient and scalable solution for metric data management is still a resisting research challenge.In this work, we try to make an important step towards such management system that would be able to scale to data collections of billions of objects. We propose a distributed index structure for similarity data management called the Metric Index (M-Index) which can answer queries in precise and approximate manner. This technique can take advantage of any distributed hash table that supports interval queries and utilize it as an underlying index. We have performed numerous experiments to test various settings of the M-Index structure and we have proved its usability by developing a full-featured publicly-available Web application.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2012
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Information Processing and Management
ISSN
0306-4573
e-ISSN
—
Volume of the periodical
48
Issue of the periodical within the volume
5
Country of publishing house
US - UNITED STATES
Number of pages
18
Pages from-to
855-872
UT code for WoS article
000307682100005
EID of the result in the Scopus database
—