Configurable Open-source Data Structure for Distributed Conforming Unstructured Homogeneous Meshes with GPU Support
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F68407700%3A21340%2F22%3A00359776" target="_blank" >RIV/68407700:21340/22:00359776 - isvavai.cz</a>
Result on the web
<a href="https://doi.org/10.1145/3536164" target="_blank" >https://doi.org/10.1145/3536164</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1145/3536164" target="_blank" >10.1145/3536164</a>
Alternative languages
Result language
angličtina
Original language name
Configurable Open-source Data Structure for Distributed Conforming Unstructured Homogeneous Meshes with GPU Support
Original language description
A general multi-purpose data structure for an efficient representation of conforming unstructured homogeneous meshes for scientific computations on CPU and GPU-based systems is presented. The data structure is provided as open-source software as part of the TNL library (https://tnl-project.org/). The abstract representation supports almost any cell shape and common 2D quadrilateral, 3D hexahedron and arbitrarily dimensional simplex shapes are currently built into the library. The implementation is highly configurable via templates of the C++ language, which allows avoiding the storage of unnecessary dynamic data. The internal memory layout is based on state-of-the-art sparse matrix storage formats, which are optimized for different hardware architectures in order to provide high-performance computations. The proposed data structure is also suitable for meshes decomposed into several subdomains and distributed computing using the Message Passing Interface (MPI). The efficiency of the implemented data structure on CPU and GPU hardware architectures is demonstrated on several benchmark problems and a comparison with another library. Its applicability to advanced numerical methods is demonstrated with an example problem of two-phase flow in porous media using a numerical scheme based on the mixed-hybrid finite element method (MHFEM). We show GPU speed-ups that rise above 20 in 2D and 50 in 3D when compared to sequential CPU computations, and above 2 in 2D and 9 in 3D when compared to 12-threaded CPU computations.
Czech name
—
Czech description
—
Classification
Type
J<sub>imp</sub> - Article in a specialist periodical, which is included in the Web of Science database
CEP classification
—
OECD FORD branch
10102 - Applied mathematics
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2022
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE
ISSN
0098-3500
e-ISSN
1557-7295
Volume of the periodical
48
Issue of the periodical within the volume
3
Country of publishing house
US - UNITED STATES
Number of pages
30
Pages from-to
—
UT code for WoS article
000865883900006
EID of the result in the Scopus database
2-s2.0-85142412558