Simulations and study of a new scheduling approach for distributed data production
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61389005%3A_____%2F16%3A00506284" target="_blank" >RIV/61389005:_____/16:00506284 - isvavai.cz</a>
Alternative codes found
RIV/00216224:14330/16:00088810
Result on the web
<a href="http://dx.doi.org/10.1088/1742-6596/762/1/012023" target="_blank" >http://dx.doi.org/10.1088/1742-6596/762/1/012023</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1088/1742-6596/762/1/012023" target="_blank" >10.1088/1742-6596/762/1/012023</a>
Alternative languages
Result language
angličtina
Original language name
Simulations and study of a new scheduling approach for distributed data production
Original language description
Distributed data processing has found its application in many fields of science (High Energy and Nuclear Physics (HENP), astronomy, biology to name only those). We have focused our research on distributed data production, an essential part of computations in HENP. Using our previous experience, we have recently proposed a new scheduling approach for distributed data production which is based on the network flow maximization model. It has a polynomial complexity providing required scalability with respect to the size of computations. Our approach improves the overall data production throughput due to three factors: transfer input files in advance before their processing (allows to decrease I/O latency). Balancing of the network traffic (includes splitting the load between several alternative transfer paths), and transfer files sequentially in a coordinated manner (allows to reduce the influence of possible network bottlenecks). In this contribution, we present the results of our new simulations based on the GridSim framework which is one of the commonly used tools in the field of distributed computations. In these simulations we study the behavior of standard scheduling approaches compared to our recently proposed approach in a realistic environment relying on the data from the STAR and ATLAS experiments and considering the influence of the background traffic. The final goal of the research is to integrate the proposed scheduling approach into the real data production framework. In order to achieve this we are constantly moving our simulations towards real use cases, study scalability of the model and the influence of the scheduling parameters on the quality of the solution.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10304 - Nuclear physics
Result continuities
Project
<a href="/en/project/LG15001" target="_blank" >LG15001: Participation of the Czech Republic in experiments at Brookhaven National Laboratory, USA</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2016
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Journal of Physics Conference Series
ISBN
—
ISSN
1742-6588
e-ISSN
—
Number of pages
7
Pages from-to
012023
Publisher name
IOP Publishing
Place of publication
Bristol
Event location
Valparaiso
Event date
Jan 18, 2016
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
000439689600023