Thread Assignment Research Articles

Achieving high performance in many multi-server systems (e.g., web hosting center, cloud) requires finding a good assignment of worker threads to servers and also effectively allocating each server’s resources to its assigned threads. The assignment and allocation components of this problem have been studied extensively but largely separately in the literature. In this paper, we introduce the <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">assign and allocate (AA)</i> problem, which seeks to simultaneously find an assignment and allocation that maximizes the total utility of the threads. Assigning and allocating the threads together can result in substantially better overall utility than performing the steps separately, as is traditionally done. We model each thread by a utility function giving its performance as a function of its assigned resources. We first prove that the AA problem is NP-hard. We then present a <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$2 (\sqrt {2}-1) > 0.828$ </tex-math></inline-formula> factor approximation algorithm for concave utility functions, which runs in <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$O(mn^{2} + n (\log mC)^{2})$ </tex-math></inline-formula> time for <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$n$ </tex-math></inline-formula> threads and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$m$ </tex-math></inline-formula> servers with <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$C$ </tex-math></inline-formula> amount of resources each. We also give a faster algorithm with the same approximation ratio and <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$O(n (\log mC)^{2})$ </tex-math></inline-formula> time complexity. We then extend the problem to two more general settings. First, we consider threads with nonconcave utility functions, and give a 1/2 factor approximation algorithm. Next, we give an algorithm for threads using multiple types of resources, and show the algorithm achieves good empirical performance. We conduct extensive experiments to test the performance of our algorithms on threads with both synthetic and realistic utility functions, and find that they achieve over 92% of the optimal utility on average. We also compare our algorithms with a number of practical heuristics, and find that our algorithms achieve up to 9 times higher total utility.

Read full abstract

The introduction of multicore/multithreaded processors, comprised of a large number of hardware contexts (virtual CPUs) that share resources at multiple levels, has made process scheduling, in particular assignment of running threads to available hardware contexts, an important aspect of system performance. Nevertheless, thread assignment of applications running on state-of-the art processors is an NP-complete problem. Over the years, numerous studies have proposed heuristic-based algorithms for thread assignment. Since the thread assignment problem is intractable, it is in general impossible to know the performance of the optimal assignment, so the room for improvement of a given algorithm is also unknown. It is therefore hard to decide whether to invest more effort and time to improve an algorithm that may already be close to optimal. In this paper, we present a statistical approach to the thread assignment problem. First, we present a method that predicts the performance of the optimal thread assignment, based on the observed performance of each thread assignment in a random sample. The method is based on Extreme Value Theory (EVT), a branch of statistics that analyses extreme deviations from the population mean. We also propose sample pruning, a method that significantly reduces the time required to apply the statistical method by reducing the number of candidate solutions that need to be measured. Finally, we show that, if no suitable heuristic-based algorithm is available, a sample of several thousand random thread assignments is enough to obtain, with high confidence, an assignment with performance close to optimal. The presented approach is architecture and application independent, and it can be used to address the thread assignment problem in various domains. It is especially well suited for systems in which the workload seldom changes. An example is network systems, which typically provide a constant set of services that are known in advance, with network applications performing a similar processing algorithm for each packet in the system. In this paper, we validate our methods with an industrial case study for a set of multithreaded network applications on an UltraSPARC T2 processor. This article is an extension of our previous work [44] , which was published in Proceedings of 17th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-2012).

Read full abstract

Thread Assignment Research Articles

Related Topics

Articles published on Thread Assignment

Revisiting thread configuration of SpMV kernels on GPU: A machine learning based approach

Parallel Overlapping Community Detection Algorithm on GPU

A Novel Mutual Insurance Model for Hedging Against Cyber Risks in Power Systems Deploying Smart Technologies

Dynamic thread mapping for power-efficient many-core systems under performance constraints

Utility Optimal Thread Assignment and Resource Allocation in Multi-Server Systems

Parallel Louvain Community Detection Algorithm Based on Dynamic Thread Assignment on Graphic Processing Unit

A DFT-Based Running Time Prediction Algorithm for Web Queries

Efficiency in Parallel Computing of FDS Model for Compartment Fire Simulation: Shared Memory System

Bulk execution of the dynamic programming for the optimal polygon triangulation problem on the GPU

Adaptive parallel Louvain community detection on a multicore platform

Hybrid CPU-GPU scheduling and execution of tree traversals

Thread Assignment in Multicore/Multithreaded Processors: A Statistical Approach

Tumbler

Energy-Efficient Thread Assignment Optimization for Heterogeneous Multicore Systems

Thread Assignment of Multithreaded Network Applications in Multicore/Multithreaded Processors

Data rate based adaptive thread assignment solution for combating the SlowPOST denial of service attack

Auto-Tuning of Thread Assignment for Matrix-Vector Multiplication on GPUs

Optimal task assignment in multithreaded processors

Optimal task assignment in multithreaded processors

Task Scheduling Based On Thread Essence and Resource Limitations

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Thread Assignment Research Articles

Related Topics

Articles published on Thread Assignment

Revisiting thread configuration of SpMV kernels on GPU: A machine learning based approach

Parallel Overlapping Community Detection Algorithm on GPU

A Novel Mutual Insurance Model for Hedging Against Cyber Risks in Power Systems Deploying Smart Technologies

Dynamic thread mapping for power-efficient many-core systems under performance constraints

Utility Optimal Thread Assignment and Resource Allocation in Multi-Server Systems

Parallel Louvain Community Detection Algorithm Based on Dynamic Thread Assignment on Graphic Processing Unit

A DFT-Based Running Time Prediction Algorithm for Web Queries

Efficiency in Parallel Computing of FDS Model for Compartment Fire Simulation: Shared Memory System

Bulk execution of the dynamic programming for the optimal polygon triangulation problem on the GPU

Adaptive parallel Louvain community detection on a multicore platform

Hybrid CPU-GPU scheduling and execution of tree traversals

Thread Assignment in Multicore/Multithreaded Processors: A Statistical Approach

Tumbler

Energy-Efficient Thread Assignment Optimization for Heterogeneous Multicore Systems

Thread Assignment of Multithreaded Network Applications in Multicore/Multithreaded Processors

Data rate based adaptive thread assignment solution for combating the SlowPOST denial of service attack

Auto-Tuning of Thread Assignment for Matrix-Vector Multiplication on GPUs

Optimal task assignment in multithreaded processors

Optimal task assignment in multithreaded processors

Task Scheduling Based On Thread Essence and Resource Limitations