Using machine learning to partition streaming programs

Zheng Wang,Michael F P O'Boyle

doi:10.1145/2512436

Abstract

Stream-based parallel languages are a popular way to express parallelism in modern applications. The efficient mapping of streaming parallelism to today's multicore systems is, however, highly dependent on the program and underlying architecture. We address this by developing a portable and automatic compiler-based approach to partitioning streaming programs using machine learning. Our technique predicts the ideal partition structure for a given streaming application using prior knowledge learned offline. Using the predictor we rapidly search the program space (without executing any code) to generate and select a good partition. We applied this technique to standard StreamIt applications and compared against existing approaches. On a 4-core platform, our approach achieves 60% of the best performance found by iteratively compiling and executing over 3000 different partitions per program. We obtain, on average, a 1.90× speedup over the already tuned partitioning scheme of the StreamIt compiler. When compared against a state-of-the-art analytical, model-based approach, we achieve, on average, a 1.77× performance improvement. By porting our approach to an 8-core platform, we are able to obtain 1.8× improvement over the StreamIt default scheme, demonstrating the portability of our approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Using machine learning to partition streaming programs

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization

Lead the way for us

Journal: ACM Transactions on Architecture and Code Optimization	Publication Date: Sep 16, 2013
Citations: 43

Similar Papers

Using machine learning to partition streaming programs
Zheng Wang ... Michael F P O'Boyle
ACM Transactions on Architecture and Code Optimization | VOL. 10
Zheng Wang, et. al.Zheng Wang ... Michael F P O'Boyle
01 Sep 2013
ACM Transactions on Architecture and Code Optimization | VOL. 10

Partitioning streaming parallelism for multi-cores
Zheng Wang ... Michael F.P O'Boyle
-
Zheng Wang, et. al.Zheng Wang ... Michael F.P O'Boyle
11 Sep 2010
11 Sep 2010

Power/Performance Modeling and Optimization: Using and Characterizing Machine Learning Applications

-

17 Oct 2018
17 Oct 2018

A Hybrid Approach Based Diet Recommendation System Using ML and Big Data Analytics
Muhib Anwar Lambay ... S Pakkir Mohideen
Journal of Mobile Multimedia | VOL. -
Muhib Anwar Lambay, et. al.Muhib Anwar Lambay ... S Pakkir Mohideen
18 Jul 2022
Journal of Mobile Multimedia | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Using machine learning to partition streaming programs

Abstract

Talk to us

Similar Papers

More From: ACM Transactions on Architecture and Code Optimization