Compiler 2.0

Saman Amarasinghe

doi:10.1145/3372799.3397167

Abstract

Modern compilers are still built using technology that existed decades ago. These include basic algorithms and techniques for lexing, parsing, data-flow analysis, data dependence analysis, vectorization, register allocation, instruction selection, and instruction scheduling. It is high time that we modernize our compiler toolchain. In this talk, I will show the path to the modernization of one important compiler technique -- vectorization. Vectorization was first introduced in the era of Cray vector processors during the 1980's. In modernizing vectorization, I will first show how to use new techniques that better target modern hardware. While vector supercomputers need large vectors, which are only available by parallelizing loops, modern SIMD instructions efficiently work on short vectors. Thus, in 2000, we introduced Superword Level Parallelism (SLP) based vectorization. SLP finds short vector instructions within basic blocks, and by loop unrolling we can convert vector parallelism to SLP. Next, I will show how we can take advantage of the power of modern computers for compilation, by using more accurate but expensive techniques to improve SLP vectorization. Due to the hardware resource constraints of the era, like many other compiler optimizations, SLP implementation was a greedy algorithm. In 2018, we introduced goSLP, which uses integer linear programming to find an optimal instruction packing strategy and achieves 7.58% geomean performance improvement over the LLVM's SLP implementation on SPEC2017fp C/C++ programs. Finally, I will show how to truly modernize a compiler by automatically learning the necessary components of the compiler with Ithemal and Vemal. The optimality of goSLP is under LLVM's simple per instruction additive cost model that fits within the Integer programming framework. However, the actual cost of execution in a modern out-of-order, pipelined, superscalar processor is much more complex. Manually building such cost models as well as manually developing compiler optimizations is costly, tedious, error-prone and is hard to keep up with the architectural changes. Ithemal is the first learnt cost model for predicting the throughput of x86 basic blocks. It not only significantly outperforms (more than halves the error) state-of-the-art analytical hand-written tools like llvm-mca, but also is learnt from data requiring minimal human effort. Vemal is a learnt policy for end-to-end vectorization as opposed to tuning heuristics, which outperforms LLVM's SLP vectorizer. These data-driven techniques can help achieve state-of-the-art results while also reducing the development and maintenance burden of the compiler developer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Compiler 2.0

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Jun 16, 2020
Citations: 1	License type: cc-by-nc

Similar Papers

Fast, frequency‐based, integrated register allocation and instruction scheduling
Ioana Cutcutache ... Weng‐Fai Wong
Software: Practice and Experience | VOL. 38
Ioana Cutcutache, et. al.Ioana Cutcutache ... Weng‐Fai Wong
06 Nov 2007
Software: Practice and Experience | VOL. 38

Transformations and efficient parallel execution of loops with dependencies
...
-
, et. al. ...
01 Jan 1993
01 Jan 1993

Register allocation and promotion through combined instruction scheduling and loop unrolling
Łukasz Domagała ... Fabrice Rastello
-
Łukasz Domagała, et. al.Łukasz Domagała ... Fabrice Rastello
17 Mar 2016
17 Mar 2016

Experiences with Cooperating Register Allocation and Instruction Scheduling
Cindy Norris ... Lori L Pollock
International Journal of Parallel Programming | VOL. 26
Cindy Norris, et. al.Cindy Norris ... Lori L Pollock
01 Jan 1998
International Journal of Parallel Programming | VOL. 26

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Compiler 2.0

Abstract

Talk to us

Similar Papers