GPU Implementation of Bitplane Coding with Parallel Coefficient Processing for High Performance Image Compression

Research output: Contribution to journalArticleResearchpeer-review

15 Citations (Scopus)


The fast compression of images is a requisite in many applications like TV production, teleconferencing, or digital cinema. Many of the algorithms employed in current image compression standards are inherently sequential. High performance implementations of such algorithms often require specialized hardware like field integrated gate arrays. Graphics Processing Units (GPUs) do not commonly achieve high performance on these algorithms because they do not exhibit fine-grain parallelism. Our previous work introduced a new core algorithm for wavelet-based image coding systems. It is tailored for massive parallel architectures. It is called bitplane coding with parallel coefficient processing (BPC-PaCo). This paper introduces the first high performance, GPU-based implementation of BPC-PaCo. A detailed analysis of the algorithm aids its implementation in the GPU. The main insights behind the proposed codec are an efficient thread-to-data mapping, a smart memory management, and the use of efficient cooperation mechanisms to enable inter-thread communication. Experimental results indicate that the proposed implementation matches the requirements for high resolution (4 K) digital cinema in real time, yielding speedups of 30 × with respect to the fastest implementations of current compression standards. Also, a power consumption evaluation shows that our implementation consumes 40 × less energy for equivalent performance than state-of-the-art methods.

Original languageEnglish
Article number7833172
Pages (from-to)2272-2284
Number of pages13
JournalIEEE Transactions on Parallel and Distributed Systems
Issue number8
Publication statusPublished - 1 Aug 2017


  • Image coding
  • SIMD computing
  • compute unified device architecture (CUDA)
  • graphics processing unit (GPU)


Dive into the research topics of 'GPU Implementation of Bitplane Coding with Parallel Coefficient Processing for High Performance Image Compression'. Together they form a unique fingerprint.

Cite this