Loop Unrolling and Data-Path Generation of Sliding-Window Operation
Window operations which are computationally intensive and data intensive are frequently used in image compression,pattern recognition and digital signal processing.Reconfigurable hardware boards provide a convenient and flexible solution to speed up these algorithms.Based on a memory and data schedule method as well as the method of data-path generation,this paper studies the effect of loop unrolling on the area,clock speed and throughput for sliding window operations.The results indicate that due to the unique design of the compilation framework,inner loop unrolling makes the controllers become more complicated than outer loop unrolling and increase more requirements of areas at the same time.However,outer loop unrolling demands more memory elements to keep the reused data.The clock speed begins to decrease when the number of RAM modules extends to a certain size,and the throughput increase in different degrees for different operations.