Abstract
CP-PACS (Computational Physics by Parallel Array Computer System) is a massively parallel processing system with 2048 node processors for large scale scientific calculations. On a node processor of CP-PACS, there is a special hardware feature called PVP-SW (Pseudo Vector Processor based on Slide Window), which realizes an efficient vector processing on a superscalar processor without depending on the cache. The authors present the effectiveness of PVP-SW by performance measurement on a single node processor for the LINPACK benchmark. Utilizing loop unrolling techniques and the block-TLB feature, the PVP-SW function improves the basic performance up to 3.5 times faster for 1000/spl times/1000 LINPACK. This performance corresponds to the 73% of theoretical peak.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.