Towards Dynamic SQL Compilation in Apache Spark
Big-data systems have gained significant momentum, and Apache Spark is becoming a de-facto standard for modern data analytics. Spark relies on code generation to optimize the execution performance of SQL queries on a variety of data sources. Despite its already efficient runtime, Spark’s code generation suffers from significant runtime overheads related to data de-serialization during query execution. Such performance penalty can be significant, especially when applications operate on human-readable data formats such as CSV or JSON.
Tue 24 MarDisplayed time zone: Belfast change
16:00 - 17:30
|Running Parallel Bytecode Interpreters on Heterogeneous Hardware|
Juan Fumero University of Manchester, UK, Athanasios Stratikopoulos The University of Manchester, Christos Kotselidis KTM Innovation / The University of ManchesterPre-print
|Toward Presizing and Pretransitioning Strategies for GraalPython|
|Towards Dynamic SQL Compilation in Apache Spark|