Abstract
Parallel applications usually execute using the maximum number of threads allowed by the available hardware at hand to maximize performance. However, this approach may not be the best when it comes to energy efficiency and may even lead to performance decrease in some particular cases. Even though many solutions have already been proposed to find and adapt the best number of threads of a given parallel application to achieve a specific goal, which may be performance, energy-saving or Energy-Delay Product (EDP), they focus on General Purpose Processors (GPP). Differently from previous works, the current one aims to show that there is also space for EDP optimization when it comes to Embedded Systems, which usually have a lower number of cores compared to GPP, and have different characteristics concerning the micro-architecture and memory hierarchy. Through experiments on a real quad-core embedded processor, this work demonstrates that it is possible to improve the EDP (Energy-Delay Product) by up to 8.13% over the default configuration of 4 threads in OpenMP applications by adapting the number of threads in runtime, without any changes in the hardware or compiler.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.