Abstract

Branches are one of the main impediments to achieving maximum instruction-level parallelism. Branch prediction along with speculative execution is the main, incumbent methodology to hide branch stalls. Branch decoupled architectures offer an alternate way of reducing branch penalty. This paper presents a combination of the two methodologies, to show that they can coexist and may provide better performance than processors with conventional branch prediction techniques, but equivalent hardware resources. A trace based evaluation is presented, and with three different branch prediction techniques, it is shown that over twelve of the SPEC'95 CPU benchmarks, speculative branch decoupling performs significantly better than branch prediction with speculative execution, in all the cases. Since a trace based approach has its limitations, the study is a limited one, but makes a strong case for further research in this area.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.