Towards Dynamic SQL Compilation in Apache Spark (MoreVMs'20 - : Workshop on Modern Language Runtimes, Ecosystems, and VMs)

Mon 23 - Thu 26 March 2020 Porto, Portugal

Who

Filippo Schiavio, Daniele Bonetta, Walter Binder

Track

MoreVMs'20

Time Zone

The program is currently displayed in (GMT) Belfast.

Use conference time zone: (GMT) BelfastSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 24 Mar 2020 17:00 - 17:30 at W1 - Dynamic Runtime Optimizations

Abstract

Big-data systems have gained significant momentum, and Apache Spark is becoming a de-facto standard for modern data analytics. Spark relies on code generation to optimize the execution performance of SQL queries on a variety of data sources. Despite its already efficient runtime, Spark’s code generation suffers from significant runtime overheads related to data de-serialization during query execution. Such performance penalty can be significant, especially when applications operate on human-readable data formats such as CSV or JSON.

Filippo Schiavio

Università della Svizzera italiana

Italy

Daniele Bonetta

Oracle Labs

United States

Walter Binder

University of Lugano, Switzerland

Switzerland