On Optimizing the Non-metric Similarity Search in Tandem Mass Spectra by Clustering
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216208%3A11320%2F12%3A10124006" target="_blank" >RIV/00216208:11320/12:10124006 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1007/978-3-642-30191-9_18" target="_blank" >http://dx.doi.org/10.1007/978-3-642-30191-9_18</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-642-30191-9_18" target="_blank" >10.1007/978-3-642-30191-9_18</a>
Alternative languages
Result language
angličtina
Original language name
On Optimizing the Non-metric Similarity Search in Tandem Mass Spectra by Clustering
Original language description
Tandem mass spectrometry is a well-known technique for identification of protein sequences from an "in vitro" sample. To identify the sequences from spectra captured by a spectrometer, the similarity search in a database of hypothetical mass spectra is often used. For this purpose, a database of known protein sequences is utilized to generate the hypothetical spectra. Since the number of sequences in the databases grows rapidly over the time, several approaches have been proposed to index the databasesof mass spectra. In this paper, we improve an approach based on the non-metric similarity search where the M-tree and the TriGen algorithm are employed for fast and approximative search. We show that preprocessing of mass spectra by clustering speeds upthe identification of sequences more than 100x with respect to the sequential scan of the entire database. Moreover, when the protein candidates are refined by sequential scan in the postprocessing step, the whole approach exhibits precis
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2012
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
Lecture Notes in Computer Science
ISSN
0302-9743
e-ISSN
—
Volume of the periodical
2012
Issue of the periodical within the volume
7292
Country of publishing house
DE - GERMANY
Number of pages
12
Pages from-to
189-200
UT code for WoS article
—
EID of the result in the Scopus database
—