Ускорение оптимизации программ во время связывания

K Dolgorukova,S Arishin

doi:10.15514/ispras-2016-28(5)-11

Abstract

This paper proposes the two different approaches to speed-up program build: making link-time optimization work in parallel and lightweight optimization approach. The former technique is achieved by scaling LTO system. The latter makes link to work faster because of using summaries to manage some of interprocedural optimization passes instead of full IR code in memory. The problem of horizontal LTO system scaling leads to the problem of partition of the large task to several subtasks that can be performed concurrently. The problem is complicated by the compiler pipeline model: interprocedural optimization passes works consequentially and depends on previous performed ones. That means we can divide just data on which passes works, not passes themselves. We need to divide IR code to sub-independent parts and run LTO on them in parallel. We use program call graph analysis to divide a program to parts. Therefore, our goal is to divide call graph that is one of NP-compete problems. Nevertheless, the choice of the dividing algorithm strongly depends on properties of divided graph. The main goal of our investigation is to find lightweight graph partition algorithm that works efficiently on call graphs of real programs and that does not spoil LTO performance achievements after optimizing code pieces separately. This paper proposes new graph partition algorithm for program call graphs, results of comparing this one with two other methods on SPEC CPU2000 benchmark and implementation of the algorithm in scalable LLVM-based LTO system. The implementation of this approach in LTO system shows 31% link speedup and 3% of performance degradation for 4 threads. The lightweight optimization shows 0,5% speedup for single run in lazy code loading mode.

Highlights

This paper proposes the two different approaches to speed-up program build: making link-time optimization work in parallel and lightweight optimization approach
The former technique is achieved by scaling LTO system. The latter makes link to work faster because of using summaries to manage some of interprocedural optimization passes instead of full IR code in memory
The problem of horizontal LTO system scaling leads to the problem of partition of the large task to several subtasks that can be performed concurrently

Summary

Введение

Оптимизации времени связывания зарекомендовали себя как эффективный инструмент оптимизации программ [1]. Сложность подхода оптимизаций времени связывания заключается в том же, что и его сила: весь код программы находится в памяти. К примеру, сборка состоящего из 36,5 тысяч исходных C/C++ файлов браузера Firefox на компиляторах GCC и LLVM с оптимизацией времени связывания требует от 6 до 34 Гб ОЗУ в зависимости от настроек, и работает от 11 до 26 минут на x86-64 [2]. Данная статья является частью исследования, посвященной разработке масштабируемой системы оптимизации времени связывания. В 2.2 приводятся оценки разбиения, а также обсуждается применимость алгоритмов к графам вызовов программ. В разделе 3 описываются легковесные межпроцедурные оптимизирующие преобразования, поясняется их место в разрабатываемой системе оптимизации времени связывания. В 3.2 проводится краткий обзор существующих оптимизаций времени связывания и обсуждается применимость к ним предложенного метода.

Горизонтальное масштабирование системы оптимизации времени связывания

Определения

Краткий обзор методов разбиения графов

Методы оценки разбиения

Исследования свойств графов вызовов программ

Описание предлагаемого алгоритма

Легковесные оптимизации

Реализация метода разбиения

Реализация оптимизизации удаления мертвых глобальных переменных

Findings

Результаты

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Ускорение оптимизации программ во время связывания

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Institute for System Programming of the RAS

Lead the way for us

Journal: Proceedings of the Institute for System Programming of the RAS	Publication Date: Jan 1, 2016
License type: cc-by

Similar Papers

Parallel Particle Swarm Optimization with Adaptive Asynchronous Migration Strategy
Zhi-Hui Zhan ...
-
Zhi-Hui Zhan, et. al.Zhi-Hui Zhan ...
01 Jan 2009
01 Jan 2009

IPRESPS - a Programming Environment for Partitioning and Scheduling of Time Critical Tasks
M Ojsteršek ... V Žumer
IFAC Proceedings Volumes | VOL. 25
M Ojsteršek, et. al.M Ojsteršek ... V Žumer
01 Jun 1992
IFAC Proceedings Volumes | VOL. 25

Scrambler: dynamic layout adaptation
Niko Takayesu ... Thu Nguyen
-
Niko Takayesu, et. al.Niko Takayesu ... Thu Nguyen
20 Oct 2016
20 Oct 2016

IPRESPS - A PROGRAMMING ENVIRONMENT FOR PARTITIONING AND SCHEDULING OF TIME CRITICAL TASKS
V Žumer ... H Ojsteršek
Real-Time Programming 1992 | VOL. -
V Žumer, et. al.V Žumer ... H Ojsteršek
01 Jan 1992
Real-Time Programming 1992 | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Ускорение оптимизации программ во время связывания

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Proceedings of the Institute for System Programming of the RAS