Performance, power consumption and thermal behavioral evaluation of the DGX-2 platform
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27740%2F20%3A10244990" target="_blank" >RIV/61989100:27740/20:10244990 - isvavai.cz</a>
Result on the web
<a href="http://ebooks.iospress.nl/volumearticle/53970" target="_blank" >http://ebooks.iospress.nl/volumearticle/53970</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.3233/APC200091" target="_blank" >10.3233/APC200091</a>
Alternative languages
Result language
angličtina
Original language name
Performance, power consumption and thermal behavioral evaluation of the DGX-2 platform
Original language description
In this paper, we evaluate the performance, power consumption and its variation and also thermal behavior of the DGX-2 server from Nvidia. We present a development of specialized synthetic benchmarks to measure raw performance of GPUs for single, double, half precision and also Tensor Core units. With these benchmarks, we were able to reach peak performance and verify the specification provided by Nvidia. We achieved 130.79 TFLOPS peak performance in half-precision on Tensor Cores. We also measured the thermal stability of the DGX-2 system. It can hold its peak performance when all 16 GPUs are fully loaded except Tensor Core workload, when thermal throttling occurred with with up to 1% performance penalty. During single-precision workload we observed 23% variation of the power consummation of individual GPUs installed in the system. Finally, we have evaluated the behavior of the Tesla V100-SXM3 chip under the DVFS tuning. Running at optimal frequency, the compute bound workload can save up to 39% energy while the run-time increases by 51%. More importantly, memory bound workload can save up to 31% with 2% throughput penalty and during the communication over NVLink one can save up to 26% energy with no penalty. (C) 2020 The authors and IOS Press.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
<a href="/en/project/LM2018140" target="_blank" >LM2018140: e-Infrastructure CZ</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2020
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Advances in Parallel Computing. Volume 36
ISBN
978-1-64368-070-5
ISSN
0927-5452
e-ISSN
1879-808X
Number of pages
10
Pages from-to
614-623
Publisher name
IOS Press
Place of publication
Amsterdam
Event location
Praha
Event date
Sep 10, 2019
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—