Modern Computer Architectures Research Articles

On any modern computer architecture today, parallelism comes with a modest cost, born from the creation and management of threads or tasks. Today, programmers battle this cost by manually optimizing/tuning their codes to minimize the cost of parallelism without harming its benefit, performance. This is a difficult battle: programmers must reason about architectural constant factors hidden behind layers of software abstractions, including thread schedulers and memory managers, and their impact on performance, also at scale. In languages that support higher-order functions, the battle hardens: higher order functions can make it difficult, if not impossible, to reason about the cost and benefits of parallelism. Motivated by these challenges and the numerous advantages of high-level languages, we believe that it has become essential to manage parallelism automatically so as to minimize its cost and maximize its benefit. This is a challenging problem, even when considered on a case-by-case, application-specific basis. But if a solution were possible, then it could combine the many correctness benefits of high-level languages with performance by managing parallelism without the programmer effort needed to ensure performance. This paper proposes techniques for such automatic management of parallelism by combining static (compilation) and run-time techniques. Specifically, we consider the Parallel ML language with task parallelism, and describe a compiler pipeline that embeds "potential parallelism" directly into the call-stack and avoids the cost of task creation by default. We then pair this compilation pipeline with a run-time system that dynamically converts potential parallelism into actual parallel tasks. Together, the compiler and run-time system guarantee that the cost of parallelism remains low without losing its benefit. We prove that our techniques have no asymptotic impact on the work and span of parallel programs and thus preserve their asymptotic properties. We implement the proposed techniques by extending the MPL compiler for Parallel ML and show that it can eliminate the burden of manual optimization while delivering good practical performance.

Data, its volume, structure, and form of presentation are among the most significant problems in working in the medical field. The probability of error is very high without innovative high-tech data analysis tools. It is easy to miss an important factor that is critical but lost among other, less important information. This work aims to study the proposed parallel gradient boosting algorithm in combination with the Bagging algorithm in the classification of diabetes to achieve greater stability and higher accuracy, reduce computational complexity and improve performance in medicine. The methods of parallelization of the Gradient Boosting algorithm in combination with the Bagging algorithm are investigated in the paper. Performance scores were obtained: approximately 7 using ThreadPoolExecutor and an eight-core computer system and 9.5 based on CUDA technology. Performance indicators that go to the unit are calculated. This, in turn, confirms the effectiveness of the proposed parallel algorithm. Another significant result of the study is improving algorithm accuracy by increasing the number of algorithms in the composition. The problem of diagnosing a patient's diabetes based on specific measurements included in the data set is considered. Detailed analysis and pre-processing of the selected dataset were performed. The parallelization of the proposed algorithm is implemented using the multi-core architecture of modern computers and CUDA technology. The process of learning models and training samples was parallelized. The theoretical estimation of the computational complexity of the offered parallel algorithm is given. A comparison of serial and parallel algorithm execution time using ThreadPoolExecutor when varying the number of threads and algorithms in the composition is presented. And also, the comparative analysis of time expenses at consecutive and parallel execution based on CPU and GPU is carried out.

Modern Computer Architectures Research Articles

Related Topics

Articles published on Modern Computer Architectures

On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures

Custom RISC-V architecture incorporating memristive in-memory computing

First-principles algorithms for ship motion simulation based on ARMA and trochoidal wave models

A variational reformulation of molecular properties in electronic-structure theory.

Automatic Parallelism Management

Polymer-Waveguide-Integrated 2D Semiconductor Heterostructures for Optical Communications.

Analisis Sistem Bus USB Dan PCI Pada Organisasi Arsitektur Komputer

ADAPTASI ORGANISASI TERHADAP PERKEMBANGAN ARSITEKTUR KOMPUTER MODERN

CUDA-BASED PARALLELIZATION OF GRADIENT BOOSTING AND BAGGING ALGORITHM FOR DIAGNOSING DIABETES

On the use of a multigrid-reduction-in-time algorithm for multiscale convergence of turbulence simulations

A method of defense against cache timing attack in non-volatile memory

Deep Learning for Edge Computing Applications: A Comprehensive Survey

Computationally Efficient Cellular Automata‐Based Full‐Field Models of Static Recrystallization: A Perspective Review

Fundamentals of Fast Tsunami Wave Parameter Determination Technology for Hazard Mitigation.

Decimal Versus Binary Representation of Numbers in Computers

Combining p-multigrid and Multigrid Reduction in Time methods to obtain a scalable solver for Isogeometric Analysis

Classifying Co-resident Computer Programs Using Information Revealed by Resource Contention

Sequential Monte-Carlo algorithms for Bayesian model calibration – A review and method comparison✰

Sharing non‐cache‐coherent memory with bounded incoherence

Parallelization of Finding the Current Coordinates of the Lidar Based on the Genetic Algorithm and OpenMP Technology

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Modern Computer Architectures Research Articles

Related Topics

Articles published on Modern Computer Architectures

On a Simplified Approach to Achieve Parallel Performance and Portability Across CPU and GPU Architectures

Custom RISC-V architecture incorporating memristive in-memory computing

First-principles algorithms for ship motion simulation based on ARMA and trochoidal wave models

A variational reformulation of molecular properties in electronic-structure theory.

Automatic Parallelism Management

Polymer-Waveguide-Integrated 2D Semiconductor Heterostructures for Optical Communications.

Analisis Sistem Bus USB Dan PCI Pada Organisasi Arsitektur Komputer

ADAPTASI ORGANISASI TERHADAP PERKEMBANGAN ARSITEKTUR KOMPUTER MODERN

CUDA-BASED PARALLELIZATION OF GRADIENT BOOSTING AND BAGGING ALGORITHM FOR DIAGNOSING DIABETES

On the use of a multigrid-reduction-in-time algorithm for multiscale convergence of turbulence simulations

A method of defense against cache timing attack in non-volatile memory

Deep Learning for Edge Computing Applications: A Comprehensive Survey

Computationally Efficient Cellular Automata‐Based Full‐Field Models of Static Recrystallization: A Perspective Review

Fundamentals of Fast Tsunami Wave Parameter Determination Technology for Hazard Mitigation.

Decimal Versus Binary Representation of Numbers in Computers

Combining p-multigrid and Multigrid Reduction in Time methods to obtain a scalable solver for Isogeometric Analysis

Classifying Co-resident Computer Programs Using Information Revealed by Resource Contention

Sequential Monte-Carlo algorithms for Bayesian model calibration – A review and method comparison✰

Sharing non‐cache‐coherent memory with bounded incoherence

Parallelization of Finding the Current Coordinates of the Lidar Based on the Genetic Algorithm and OpenMP Technology