Методы оптимизации обобщенных тензорных сверток

doi:10.14529/cmse200202

Abstract

Tensor contraction is one of major operations defined in tensor calculus, a separate branch of mathematics, which become a fundamental language of theory of relativity, mechanics, electrodynamics, and solid state physics. Effective implementation of tensor contraction is of considerable practical significance for such areas as solving of mathematical physics problems, machine learning, spectral element methods, quantum chemistry, data mining, and high performance computing. In the last twenty years, the number of optimization methods for tensor contraction has increased and continues to grow. In this article, the author reviews widespread approaches for optimization of tensor contraction, which are used on single processor as well as multiprocessor systems with distributed memory. The review contains the description of methods for optimization of matrix and matrix-vector multiplications, important particular cases of tensor contraction, which are used as a base for the most tensor contraction optimizations. The described optimizations can be applied during program compilation performed by production compilers. The information provided in this work could be useful for systematizing knowledge.

Highlights

Развитие вычислений с тензорами привели к созданию «Тензорные исчисления», отдельного активно развивающегося раздела математики, использующегося почти во всех областях механики, теоретической физики и прикладной математики
Tensor contraction is one of major operations defined in tensor calculus, a separate branch of mathematics, which become a fundamental language of theory of relativity, mechanics, electrodynamics, and solid state physics
Effective implementation of tensor contraction is of considerable practical significance for such areas as solving of mathematical physics problems, machine learning, spectral element methods, quantum chemistry, data mining, and high performance computing

Summary

Оптимизация MMM и MVM

В силу широкой распространенности и практической значимости оптимизация MMM и MVM является отдельным предметом изучения высокопроизводительных вычислений [9]. В ходе ее работы алгоритм оценивает параметры целевой архитектуры, такие как, например, количество промахов кэша для данной реализации. Нахождение значений параметров программы, для которой выполнено аналитическое моделирование, и известны технические характеристики целевого процессора, выполняется за постоянное время и, вследствие этого, может использоваться в условиях ограниченного времени [28, 35]. В качестве примера модели гипотетического процессора рассмотрим модель, используемую в фреймворке BLIS для нахождения значений параметров реализации MMM [17]. Как и в случае MMM, аналитическое моделирование может применяться для оптимизации MVM с целью достижения производительности кода, оптимизированного вручную [8, 42]. POCA использует эвристический алгоритм для распределения регистров процессора на основе анализа графа зависимостей и особенностей целевой архитектуры, таких как количество векторных регистров и количество тактов, требуемых для выполнения отдельных векторных инструкций микроядра

Оптимизация TC

TensorFlow

OPTIMIZATION METHODS FOR GENERALIZED TENSOR CONTRACTION

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Методы оптимизации обобщенных тензорных сверток

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bulletin of the South Ural State University. Series "Computational Mathematics and Software Engineering"

Lead the way for us

Journal: Bulletin of the South Ural State University. Series "Computational Mathematics and Software Engineering"	Publication Date: May 1, 2020
License type: cc-by

Similar Papers

Optimizing Tensor Contractions in CCSD(T) for Efficient Execution on GPUs
Jinsung Kim ... Rohit Kumar Srivastava
-
Jinsung Kim, et. al.Jinsung Kim ... Rohit Kumar Srivastava
12 Jun 2018
12 Jun 2018

A Communication-Optimal Framework for Contracting Distributed Tensors
Samyam Rajbhandari ... Akshay Nikam
-
Samyam Rajbhandari, et. al.Samyam Rajbhandari ... Akshay Nikam
01 Nov 2014
01 Nov 2014

Optimizing tensor contraction expressions for hybrid CPU-GPU execution
...
Cluster Computing | VOL. -
, et. al. ...
01 Mar 2013
Cluster Computing | VOL. -

Optimizing tensor contraction expressions for hybrid CPU-GPU execution
Wenjing Ma ... Oreste Villa
Cluster Computing | VOL. 16
Wenjing Ma, et. al.Wenjing Ma ... Oreste Villa
24 Nov 2011
Cluster Computing | VOL. 16

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Методы оптимизации обобщенных тензорных сверток

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Bulletin of the South Ural State University. Series "Computational Mathematics and Software Engineering"