OpenCL Kernel Fusion for GPU, Xeon Phi and CPU
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F15%3A00083464" target="_blank" >RIV/00216224:14330/15:00083464 - isvavai.cz</a>
Result on the web
<a href="http://dx.doi.org/10.1109/SBAC-PAD.2015.29" target="_blank" >http://dx.doi.org/10.1109/SBAC-PAD.2015.29</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1109/SBAC-PAD.2015.29" target="_blank" >10.1109/SBAC-PAD.2015.29</a>
Alternative languages
Result language
angličtina
Original language name
OpenCL Kernel Fusion for GPU, Xeon Phi and CPU
Original language description
Kernel fusion is an optimization method, in which the code from several kernels is composed to create a new, fused kernel. It can push the performance of kernels beyond limits given for their isolated, unfused form. In this paper, we introduce a classification of different types of kernel fusion for both data dependent and data independent kernels. We study kernel fusion on three types of OpenCL devices: GPU, Xeon Phi and CPU. Those hardware platforms have quite different properties, thus, kernel fusionoften affects performance in quite different ways. We analyze the impact of kernel fusion on those hardware platforms and show how it can be used to improve performance. Based on our study we also introduce a basic transformation method for generating fused kernels, which has good potential to be automatized.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
<a href="/en/project/EE2.3.30.0037" target="_blank" >EE2.3.30.0037: Employment of Best Young Scientists for International Cooperation Empowerment</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2015
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of IEEE International Symposium on Computer Architecture and High Performance Computing
ISBN
—
ISSN
1550-6533
e-ISSN
—
Number of pages
8
Pages from-to
98-105
Publisher name
IEEE
Place of publication
Florianópolis
Event location
Florianópolis, Brazil
Event date
Jan 1, 2015
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—