Abstract

Developing embedded parallel applications efficiently in modern single-chip many-core architectures is challenging. We present a novel methodology to facilitate crucial issues of parallel software development such as performance evaluation, speedup and bottleneck analysis, and system verification by taking the advantages of exploring many-core platforms in different abstraction levels altogether. To demonstrate our methodology, a design framework is proposed, consisting of a scalable many-core processor architecture and a hardware-independent software layer. Based on the scalable architecture, the prototype platform is also presented in Electronic System Level (ESL) and Register Transfer Level (RTL). The platform integrates sixteen Processing Elements (PEs) and a 4-by-4 mesh-based network with external memory. Our hardware-independent software layer aims at realistic parallel applications in the baremetal environment, i.e., without an Operating System (OS). With the on-chip communication library and system utilities, software built on top of the fast ESL many-core platform can be executed seamlessly on the RTL one. The analysis shows that our ESL simulation model provides relatively accurate performance metrics as compared with the RTL implementation, with up to 773.8 times faster in terms of simulation speed. In addition, we showcase the study of speedup and bottleneck analysis, which justifies the effectiveness of our methodology for embedded applications in many-core architectures.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call