ELIXIR-CZ: Capacity building
Project goals
The project plans to build and expand the computing and storage infrastructure in the Czech Republic for the storage, processing and analysis of data from the field of life sciences. It also directly supports the development of selected tools and databases that are very beneficial to the scientific community and which enrich the infrastructure with advanced and highly sophisticated services. The project strengthens the current computing infrastructure by acquiring and installing the equivalent of 200 high-density computing nodes (2-4 CPUs, approx. 256 GB RAM) in a cluster three, adding about 2 PB storage capacity to the ELIXIR-CZ community, and extending the software portfolio by restoring existing and adding new licensing of relevant commercial software packages. The three research programs in the project will increase the capabilities of the ELIXIR-CZ infrastructure by using (1) the new functionality of RepeatExplorer (a bioinformatic tool for characterizing repetitive elements of the new generation of sequential data (NGS), which will be enhanced by the ability to process long sequential readings produced by modern sequencers) 2) Portal Accessible Dolbico Platform for Analysis of Structure-Functional Nucleic Acid Relationships and (3) Enhancing the Quality of Biomacromolecular Structures in Protein Database (PDB) by developing ligand validation tools (MotveValidator and ValidatorDB) and for automatic annotation of the secondary protein structure. All these activities will strengthen the ELIXIR-CZ infrastructure and increase its prestige in the international environment. A detailed description of the objectives is contained in the Feasibility Study in chapters 3.4, 4 and 5.
Keywords
Public support
Provider
Ministry of Education, Youth and Sports
Programme
—
Call for proposals
—
Main participants
Masarykova univerzita / Ústav výpočetní techniky
Contest type
OP - EU Operational Programme
Contract ID
16_013/0001777_01
Alternative language
Project name in Czech
ELIXIR-CZ: Budování kapacit
Annotation in Czech
Projekt plánuje v ČR vybudovat a rozšířit výpočetní a úložnou infrastrukturu pro ukládání, zpracování a analýzu dat z oblasti věd o živé přírodě. Přímo podporuje rovněž rozvoj vybraných nástrojů a databází, které jsou velmi přínosné pro vědeckou komunitu a které obohacují infrastrukturu o pokročilé a vysoce sofistikované služby. Projekt posílí současnou výpočetní infrastrukturu pořízením a instalací ekvivalentu 200 výpočetních uzlů s vysokou hustotou (2-4 CPU, cca 256 GB RAM) ve trojici clusterů, přidá cca 2 PB úložné kapacity pro komunitu ELIXIR-CZ a rozšíří softwarové portfolio obnovením stávajících a přidáním nových licencí relevantních komerčních softwarových balíků. Tři výzkumné programy v projektu zvýší možnosti infrastruktury ELIXIR-CZ pomocí (1) nové funkcionality nástroje RepeatExplorer (bioinformatický nástroj pro charakterizaci repetitivních prvků sekvenačních dat nové generace (NGS), který bude rozšířen o schopnost zpracovávat dlouhá sekvenanční čtení, produkované moderními sekvenátory), (2) portálově přístupnou platformu Dolbico pro analýzu strukturně-funkčních vztahů nukleových kyselin a (3) zvýšením kvality biomakromolekulárních struktur v proteinové databance (PDB) vývojem nástrojů pro validaci ligandů (MotveValidator a ValidatorDB) a pro automatickou anotaci sekundární struktury proteinů. Všechny tyto aktivity posílí infrastrukturu ELIXIR-CZ a zvýší její prestiž v mezinárodním prostředí. Detailní popis cílů je obsažen ve Studii proveditelnosti v kapitolách 3.4, 4 a 5.
Scientific branches
R&D category
IF - RDI infrastructure
CEP classification - main branch
EI - Biotechnology and bionics
CEP - secondary branch
IN - Informatics
CEP - another secondary branch
—
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
20801 - Environmental biotechnology
20802 - Bioremediation, diagnostic biotechnologies (DNA chips and biosensing devices) in environmental management
20803 - Environmental biotechnology related ethics
20901 - Industrial biotechnology
20902 - Bioprocessing technologies (industrial processes relying on biological agents to drive the process) biocatalysis, fermentation
20903 - Bioproducts (products that are manufactured using biological material as feedstock) biomaterials, bioplastics, biofuels, bioderived bulk and fine chemicals, bio-derived novel materials
30401 - Health-related biotechnology
30402 - Technologies involving the manipulation of cells, tissues, organs or the whole organism (assisted reproduction)
30403 - Technologies involving identifying the functioning of DNA, proteins and enzymes and how they influence the onset of disease and maintenance of well-being (gene-based diagnostics and therapeutic interventions [pharmacogenomics, gene-based therapeutics])
30404 - Biomaterials (as related to medical implants, devices, sensors)
30405 - Medical biotechnology related ethics
40401 - Agricultural biotechnology and food biotechnology
40402 - GM technology (crops and livestock), livestock cloning, marker assisted selection, diagnostics (DNA chips and biosensing devices for the early/accurate detection of diseases) biomass feedstock production technologies, biopharming
40403 - Agricultural biotechnology related ethics
Completed project evaluation
Provider evaluation
V - Vynikající výsledky projektu (s mezinárodním významem atd.)
Project results evaluation
The project helped to build and expand the computing and storage infrastructure for storage, processing and analysis of data in the field of life sciences. The project strengthened the current computing infrastructure by acquiring and installing high-density computing nodes in three clusters, added about 2 PB of storage capacity for the ELIXIR-CZ community and expanded the software portfolio by renewing existing and adding new licenses of relevant commercial software. The possibilities of the ELIXIR-CZ infrastructure were expanded with the new functionality of the RepeatExplorer tool (bioinformatics tool for characterization of repetitive elements of next generation sequence data, which was extended by the ability to process long sequence reads), portal DNATCO for analysis of structural-functional nucleic acid relationships and quality of biomacromolecular structures in protein database. This increased the prestige of ELIXIR-CZ infrastructure in the international environment.
Solution timeline
Realization period - beginning
May 1, 2017
Realization period - end
Apr 30, 2021
Project status
U - Finished project
Latest support payment
Sep 6, 2020
Data delivery to CEP
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data delivery code
CEP22-MSM-EF-U
Data delivery date
Jun 30, 2022
Finance
Total approved costs
67,781 thou. CZK
Public financial support
67,781 thou. CZK
Other public sources
0 thou. CZK
Non public and foreign sources
0 thou. CZK
Basic information
Recognised costs
67 781 CZK thou.
Public support
67 781 CZK thou.
100%
Provider
Ministry of Education, Youth and Sports
CEP
EI - Biotechnology and bionics
Solution period
01. 05. 2017 - 30. 04. 2021