Abstract
As application complexity continues to grow, using multicore processors has been proved to be an effective methodology to meet the ever-increasing processing demand across the industry association. The Master/Slave model, the Data Flow model and the OpenMP model are the three dominant models for parallel programming. In this paper, the first two models are briefly discussed while the OpenMP model is focused. Some factors (e.g. the number of threads, the scheduling strategy, the load balance, etc.) that affecting the execution performance of OpenMP programs were also studied in this paper. This paper presents a method of taking advantage of the OpenMP model to realize the image edge detection within the platform of TMS320C6678 DSP. The experimental results show that the OpenMP model has a better advantage on scalability and flexibility compared to the Master/Slave model and the Data Flow model. The best performance can be obtained when the number of threads is equal to the number of cores which are available within the platform. Under the circumstance of using the eight cores of TMS320C6678 DSP simultaneously, an image of 1024×768 pixels just needs 6.192ms to complete the edge detection. This result is impressive compared to the Master/Slave model's which saves 32.10% in time. Further more, if we use 1 to 8 cores, the respective execution time reduces resulting in the speedup approximately conforms to the Gustafson's law. In the case of 8 cores, the speedup reaches 7.233.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.