Abstract

Color space conversion (CSC) is an important kernel in the area of image and video processing applications including video compression. CSC is a compute-intensive time-consuming operation that consumes up to 40% of processing time of a highly optimised decoder. Several hardware and software implementations for CSC have been found. Hardware implementations can achieve a higher performance compared with software-only solutions. However, the flexibility of software solutions is desirable for various functional requirements and faster time to market. Multicore processors, especially programmable GPUs, provide an opportunity to increase the performance of CSC by exploiting data parallelism. In this paper, we present a novel approach for efficient implementation of color space conversion. The proposed approach has been implemented and verified using computed unified device architecture (CUDA) on graphics hardware. Our experiments results show that the speedup of up to17×can been obtained.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.