Automatic Fusions of CUDA-GPU Kernels for Parallel Map
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216224%3A14330%2F11%3A00054287" target="_blank" >RIV/00216224:14330/11:00054287 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Automatic Fusions of CUDA-GPU Kernels for Parallel Map
Original language description
When implementing a function mapping on the contemporary GPU, several contradictory performance factors affecting distribution of computation into GPU kernels have to be balanced. A decomposition-fusion scheme suggests to decompose the computational problem to be sol-ved by several simple functions implemented as standalone kernels and to fuse some of these functions later into more complex kernels to improve memory locality. In this paper, a prototype of source-to-source compiler automating the fusionphase is presented and the impact of fusions generated by the compiler as well as compiler efficiency is experimentally evaluated.
Czech name
—
Czech description
—
Classification
Type
J<sub>x</sub> - Unclassified - Peer-reviewed scientific article (Jimp, Jsc and Jost)
CEP classification
IN - Informatics
OECD FORD branch
—
Result continuities
Project
—
Continuities
Z - Vyzkumny zamer (s odkazem do CEZ)<br>S - Specificky vyzkum na vysokych skolach
Others
Publication year
2011
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Name of the periodical
ACM SIGARCH Computer Architecture News
ISSN
0163-5964
e-ISSN
—
Volume of the periodical
39
Issue of the periodical within the volume
4
Country of publishing house
US - UNITED STATES
Number of pages
2
Pages from-to
98-99
UT code for WoS article
—
EID of the result in the Scopus database
—