Repository logo

Infoscience

  • English
  • French
Log In
Logo EPFL, École polytechnique fédérale de Lausanne

Infoscience

  • English
  • French
Log In
  1. Home
  2. Academic and Research Output
  3. EPFL thesis
  4. Compilation and Code Optimization for Data Analytics
 
doctoral thesis

Compilation and Code Optimization for Data Analytics

Shaikhha, Amir  
2018

The trade-offs between the use of modern high-level and low-level programming languages in constructing complex software artifacts are well known. High-level languages allow for greater programmer productivity: abstraction and genericity allow for the same functionality to be implemented with significantly less code compared to low-level languages. Modularity, object-orientation, functional programming, and powerful type systems allow programmers not only to create clean abstractions and protect them from leaking, but also to define code units that are reusable and easily composable, and software architectures that are adaptable and extensible. The abstraction, succinctness, and modularity of high-level code help to avoid software bugs and facilitate debugging and maintenance.

The use of high-level languages comes at a performance cost: increased indirection due to abstraction, virtualization, and interpretation, and superfluous work, particularly in the form of tempory memory allocation and deallocation to support objects and encapsulation.
As a result of this, the cost of high-level languages for performance-critical systems may seem prohibitive.

The vision of abstraction without regret argues that it is possible to use high-level languages for building performance-critical systems that allow for both productivity and high performance, instead of trading off the former for the latter. In this thesis, we realize this vision for building different types of data analytics systems. Our means of achieving this is by employing compilation. The goal is to compile away expensive language features -- to compile high-level code down to efficient low-level code.

  • Files
  • Details
  • Metrics
Type
doctoral thesis
DOI
10.5075/epfl-thesis-8762
Author(s)
Shaikhha, Amir  
Advisors
Koch, Christoph  
Jury

Prof. Viktor Kuncak (président) ; Prof. Christoph Koch (directeur de thèse) ; Prof. Martin Odersky, Prof. Val Tannen, Dr Dimitrios Vytiniotis (rapporteurs)

Date Issued

2018

Publisher

EPFL

Publisher place

Lausanne

Public defense year

2018-08-31

Thesis number

8762

Total of pages

328

Subjects

High-level programming languages

•

Domain-specific languages

•

Program synthesis

•

Query processing

•

Generative programming

•

Optimizing compilers

•

Abstraction without regret

•

Database optimization

•

Data analytics

EPFL units
DATA  
Faculty
IC  
School
IINFCOM  
Doctoral School
EDIC  
Available on Infoscience
August 29, 2018
Use this identifier to reference this record
https://infoscience.epfl.ch/handle/20.500.14299/148017
Logo EPFL, École polytechnique fédérale de Lausanne
  • Contact
  • infoscience@epfl.ch

  • Follow us on Facebook
  • Follow us on Instagram
  • Follow us on LinkedIn
  • Follow us on X
  • Follow us on Youtube
AccessibilityLegal noticePrivacy policyCookie settingsEnd User AgreementGet helpFeedback

Infoscience is a service managed and provided by the Library and IT Services of EPFL. © EPFL, tous droits réservés