Advanced SPARQL querying in small molecule databases
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61388963%3A_____%2F16%3A00461690" target="_blank" >RIV/61388963:_____/16:00461690 - isvavai.cz</a>
Výsledek na webu
<a href="http://jcheminf.springeropen.com/articles/10.1186/s13321-016-0144-4" target="_blank" >http://jcheminf.springeropen.com/articles/10.1186/s13321-016-0144-4</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1186/s13321-016-0144-4" target="_blank" >10.1186/s13321-016-0144-4</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Advanced SPARQL querying in small molecule databases
Popis výsledku v původním jazyce
In recent years, the Resource Description Framework (RDF) and the SPARQL query language have become more widely used in the area of cheminformatics and bioinformatics databases. These technologies allow better interoperability of various data sources and powerful searching facilities. However, we identified several deficiencies that make usage of such RDF databases restrictive or challenging for common users. We extended a SPARQL engine to be able to use special procedures inside SPARQL queries. This allows the user to work with data that cannot be simply precomputed and thus cannot be directly stored in the database. We designed an algorithm that checks a query against data ontology to identify possible user errors. This greatly improves query debugging. We also introduced an approach to visualize retrieved data in a user-friendly way, based on templates describing visualizations of resource classes. To integrate all of our approaches, we developed a simple web application. Our system was implemented successfully, and we demonstrated its usability on the ChEBI database transformed into RDF form. To demonstrate procedure call functions, we employed compound similarity searching based on OrChem. The application is publicly available at https://bioinfo.uochb.cas.cz/projects/chemRDF.
Název v anglickém jazyce
Advanced SPARQL querying in small molecule databases
Popis výsledku anglicky
In recent years, the Resource Description Framework (RDF) and the SPARQL query language have become more widely used in the area of cheminformatics and bioinformatics databases. These technologies allow better interoperability of various data sources and powerful searching facilities. However, we identified several deficiencies that make usage of such RDF databases restrictive or challenging for common users. We extended a SPARQL engine to be able to use special procedures inside SPARQL queries. This allows the user to work with data that cannot be simply precomputed and thus cannot be directly stored in the database. We designed an algorithm that checks a query against data ontology to identify possible user errors. This greatly improves query debugging. We also introduced an approach to visualize retrieved data in a user-friendly way, based on templates describing visualizations of resource classes. To integrate all of our approaches, we developed a simple web application. Our system was implemented successfully, and we demonstrated its usability on the ChEBI database transformed into RDF form. To demonstrate procedure call functions, we employed compound similarity searching based on OrChem. The application is publicly available at https://bioinfo.uochb.cas.cz/projects/chemRDF.
Klasifikace
Druh
J<sub>x</sub> - Nezařazeno - Článek v odborném periodiku (Jimp, Jsc a Jost)
CEP obor
CF - Fyzikální chemie a teoretická chemie
OECD FORD obor
—
Návaznosti výsledku
Projekt
<a href="/cs/project/LM2015047" target="_blank" >LM2015047: Česká národní infrastruktura pro biologická data</a><br>
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace
Ostatní
Rok uplatnění
2016
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
Journal of Cheminformatics
ISSN
1758-2946
e-ISSN
—
Svazek periodika
8
Číslo periodika v rámci svazku
Jun 6
Stát vydavatele periodika
GB - Spojené království Velké Británie a Severního Irska
Počet stran výsledku
14
Strana od-do
—
Kód UT WoS článku
000377064900001
EID výsledku v databázi Scopus
2-s2.0-84973379995