Xeon Phi Acceleration of Domain Decomposition Iterations via Heterogeneous Active Messages

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27120%2F18%3A10239485" target="_blank" >RIV/61989100:27120/18:10239485 - isvavai.cz</a>
Nalezeny alternativní kódy
RIV/61989100:27240/18:10239485 RIV/61989100:27730/18:10239485 RIV/61989100:27740/18:10239485
Výsledek na webu
<a href="https://aip.scitation.org/doi/abs/10.1063/1.5043963" target="_blank" >https://aip.scitation.org/doi/abs/10.1063/1.5043963</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1063/1.5043963" target="_blank" >10.1063/1.5043963</a>

Jazyk výsledku
angličtina
Název v původním jazyce
Xeon Phi Acceleration of Domain Decomposition Iterations via Heterogeneous Active Messages
Popis výsledku v původním jazyce
We present an acceleration strategy for a domain-decomposition iterative solver based on local Schur complements with respect to the skeleton of the computational domain. In finite element tearing and interconnecting (FETI) this results in an iterative application of a large number of dense matrices. For the offload of such kernels to the Intel (R) Xeon Phi (TM) coprocessors we use the Heterogeneous Active Messages (HAM) library. A simple load balancing strategy is presented to efficiently utilize both the host CPU and the available coprocessors during individual iterations.
Název v anglickém jazyce
Xeon Phi Acceleration of Domain Decomposition Iterations via Heterogeneous Active Messages
Popis výsledku anglicky
We present an acceleration strategy for a domain-decomposition iterative solver based on local Schur complements with respect to the skeleton of the computational domain. In finite element tearing and interconnecting (FETI) this results in an iterative application of a large number of dense matrices. For the offload of such kernels to the Intel (R) Xeon Phi (TM) coprocessors we use the Heterogeneous Active Messages (HAM) library. A simple load balancing strategy is presented to efficiently utilize both the host CPU and the available coprocessors during individual iterations.

Projekt
Výsledek vznikl pri realizaci vícero projektů. Více informací v záložce Projekty.
Návaznosti
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)<br>S - Specificky vyzkum na vysokych skolach

Rok uplatnění
2018
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Podobné výsledky(10)