An Executable Sequential Specification for Spark Aggregation
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F00216305%3A26230%2F17%3APU127271" target="_blank" >RIV/00216305:26230/17:PU127271 - isvavai.cz</a>
Result on the web
<a href="http://www.fit.vutbr.cz/research/pubs/all.php?id=11330" target="_blank" >http://www.fit.vutbr.cz/research/pubs/all.php?id=11330</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1007/978-3-319-59647-1_31" target="_blank" >10.1007/978-3-319-59647-1_31</a>
Alternative languages
Result language
angličtina
Original language name
An Executable Sequential Specification for Spark Aggregation
Original language description
Spark is a new promising platform for scalable data-parallel computation. It provides several high-level application programming interfaces (APIs) to perform parallel data aggregation. Since execution of parallel aggregation in Spark is inherently non-deterministic, a natural requirement for Spark programs is to give the same result for any execution on the same data set. We present PureSpark, an executable formal Haskell specification for Spark aggregate combinators. Our specification allows us to deduce the precise condition for deterministic outcomes from Spark aggregation. We report case studies analyzing deterministic outcomes and correctness of Spark programs.
Czech name
—
Czech description
—
Classification
Type
D - Article in proceedings
CEP classification
—
OECD FORD branch
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Result continuities
Project
Result was created during the realization of more than one project. More information in the Projects tab.
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2017
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Article name in the collection
Proceedings of NETYS'17
ISBN
—
ISSN
0302-9743
e-ISSN
—
Number of pages
15
Pages from-to
421-438
Publisher name
Springer Verlag
Place of publication
Heidelberg
Event location
Marrakech
Event date
May 17, 2017
Type of event by nationality
WRD - Celosvětová akce
UT code for WoS article
—