Production Compilers Research Articles

AbstractThe resurgence of machine learning has increased the demand for high‐performance basic linear algebra subroutines (BLAS), which have long depended on libraries to achieve peak performance on commodity hardware. High‐performance BLAS implementations rely on a layered approach that consists of tiling and packing layers—for data (re)organization—and micro kernels that perform the actual computations. The algorithm for the tiling and packing layers is target independent but is parameterized to the memory hierarchy and register‐file size. The creation of high‐performance micro kernels requires significant development effort to write tailored assembly code for each architecture. This hand optimization task is complicated by the recent introduction of matrix engines by 's (Matrix Multiply Assist—MMA), (Advanced Matrix eXtensions—AMX), and (Matrix Extensions—ME) to deliver high‐performance matrix operations. This article presents a compiler‐only alternative to the use of high‐performance libraries by incorporating, to the best of our knowledge and for the first time, the automatic generation of the layered approach into LLVM, a production compiler. Modular design of the algorithm, such as the use of LLVM's matrix‐multiply intrinsic for a clear interface between the tiling and packing layers and the micro kernel, makes it easy to retarget the code generation to multiple accelerators. The parameterization of the tiling and packing layers is demonstrated in the generation of code for the MMA unit on IBM's POWER10. This article also describes an algorithm that lowers the matrix‐multiply intrinsic to the MMA unit. The use of intrinsics enables a comprehensive performance study. In processors without hardware matrix engines, the tiling and packing delivers performance up to (Intel)—for small matrices—and more than (POWER9)—for large matrices—faster than PLuTo, a widely used polyhedral optimizer. The performance also approaches high‐performance libraries and is only slower than OpenBLAS and on‐par with Eigen for large matrices. With MMA in POWER10 this solution is, for large matrices, over faster the vector‐extension solution, matches Eigen performance, and achieves up to of BLAS peak performance.

Learning media has an influence in efforts to increase student learning achievement. The selection of traditional learning media has begun to shift to modern and up-to-date learning media that are more practical, attractive, interesting, creative and innovative. This research was conducted to produce the development of writing materials and learning media for Indonesian language learning using Microsoft PowerPoint for class VII semester 2 students of SMP Negeri 2 Pegajahan. This type of development research data is in the form of qualitative data and quantitative data. The research uses the research and development (R&D) model of Borg and Gall. The purpose of this study is the compilation of a learning media product as an effort to develop writing materials. In connection with this goal, the researcher uses seven steps in developing learning media, namely analyzing the needs of teachers and students, product development with Microsoft PowerPoint, validation of media experts and material experts and language teachers Indonesia, the first phase of revision, field trials, and finally the final revision. The quality of learning media products can be known through the average obtained from the validation results from Indonesian language teachers of 95%. The average percentage of the assessment results is 87.18% with a very good category. In addition, the quality of development products can also be known through the results of excellent student feedback. The product of this research is learning materials and media designed for class VII SMP in the second semester. This product contains eight Competency Standards which are translated into seventeen Basic Competencies and contained in twelve lessons. The twelve lessons are structured and integrated with one main menu. Each media unit is arranged systematically including apperception, competency standards and basic competencies, indicators, materials, sample questions and competency tests in which there are various media that are arranged simultaneously including text, audio recordings, videos and animations. The results of this study are very relevant to the 2013 curriculum because it is in accordance with the standards of the interactive, inspiring, fun, creative, challenging and motivating learning process for students. Further research is needed to determine the effectiveness and influence of the use of this media on student achievement.

Production Compilers Research Articles

Related Topics

Articles published on Production Compilers

Fast matrix multiplication via compiler‐only layered data reorganization and intrinsic lowering

The road not taken: exploring alias analysis based optimizations missed by the compiler

DEVELOPMENT OF INDONESIAN LEARNING MEDIA BY USING MICROSOFT POWERPOINT FOR STUDENTS OF CLASS VII SMP NEGERI 2

On the Transformation Optimization for Stencil Computation

Reconciling optimization with secure compilation

FPL: fast Presburger arithmetic through transprecision

Not so fast: understanding and mitigating negative impacts of compiler optimizations on code reuse gadget sets

Optimizing smart manufacturing systems by extending the smart products paradigm to the beginning of life

Natural Product Inhibitors of Cyclooxygenase (COX) Enzyme: A Review on Current Status and Future Perspectives.

Методы оптимизации обобщенных тензорных сверток

Building a Polyhedral Representation from an Instrumented Execution

DeepFuzz: Automatic Generation of Syntax Valid C Programs for Fuzz Testing

Crellvm: verified credible compilation for LLVM

Polyhedral auto-transformation with no integer linear programming

Cleaning Product Ingredient Safety: What Is the Current State of Availability of Information Regarding Ingredients in Products and Their Function?

Automatic Contract Insertion with CCBot

Miniphases: compilation using modular and efficient tree transformations

Control-Flow Integrity

Specifying and executing optimizations for generalized control flow graphs

Finding deep compiler bugs via guided stochastic program mutation

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Production Compilers Research Articles

Related Topics

Articles published on Production Compilers

Fast matrix multiplication via compiler‐only layered data reorganization and intrinsic lowering

The road not taken: exploring alias analysis based optimizations missed by the compiler

DEVELOPMENT OF INDONESIAN LEARNING MEDIA BY USING MICROSOFT POWERPOINT FOR STUDENTS OF CLASS VII SMP NEGERI 2

On the Transformation Optimization for Stencil Computation

Reconciling optimization with secure compilation

FPL: fast Presburger arithmetic through transprecision

Not so fast: understanding and mitigating negative impacts of compiler optimizations on code reuse gadget sets

Optimizing smart manufacturing systems by extending the smart products paradigm to the beginning of life

Natural Product Inhibitors of Cyclooxygenase (COX) Enzyme: A Review on Current Status and Future Perspectives.

Методы оптимизации обобщенных тензорных сверток

Building a Polyhedral Representation from an Instrumented Execution

DeepFuzz: Automatic Generation of Syntax Valid C Programs for Fuzz Testing

Crellvm: verified credible compilation for LLVM

Polyhedral auto-transformation with no integer linear programming

Cleaning Product Ingredient Safety: What Is the Current State of Availability of Information Regarding Ingredients in Products and Their Function?

Automatic Contract Insertion with CCBot

Miniphases: compilation using modular and efficient tree transformations

Control-Flow Integrity

Specifying and executing optimizations for generalized control flow graphs

Finding deep compiler bugs via guided stochastic program mutation