Towards Dynamic SQL Compilation in Apache Spark
Big-data systems have gained significant momentum, and Apache Spark is becoming a de-facto standard for modern data analytics. Spark relies on code generation to optimize the execution performance of SQL queries on a variety of data sources. Despite its already efficient runtime, Spark’s code generation suffers from significant runtime overheads related to data de-serialization during query execution. Such performance penalty can be significant, especially when applications operate on human-readable data formats such as CSV or JSON.
Tue 24 Mar Times are displayed in time zone: (GMT+01:00) Greenwich Mean Time : Belfast change
|16:00 - 16:30|
Juan FumeroUniversity of Manchester, UK, Athanasios StratikopoulosThe University of Manchester, Christos KotselidisKTM Innovation / The University of ManchesterPre-print
|16:30 - 17:00|
|17:00 - 17:30|