gpu debayer benchmarkBenchmarks for Debayer Algorithms on CPU and GPU

Here we presents image demosaicing results for widely used debayer algorithms and compare CPU and GPU implementations.


  • Core i7-4770, 3.6 GHz, DDR-III 8 GB
  • NVIDIA GeForce GTX 1080, driver 368.39
  • Windows-7 (64-bit)


  • Fastvideo GPU Debayer (CPU and GPU)
    • HQLI (High Quality Linear Interpolation algorithm)
    • DFPD (Directional Filtering and a Posteriori Decision algorithm)
  • Intel IPP-7.1 image processing library (CPU)
    • ippiCFAToRGB_8u_C1C3R (AN3 algorithm, multithreaded)
    • ippiDemosaicAHD_8u_C1C3R (AHD algorithm, multithreaded)
  • OpenGL software on GeForce 9800 GX2 (according to McGuire article) for HQLI algorithm

We consider the following debayer algorithms: bilinear (AN3 from Intel IPP library), AHD (Adaptive Homogenuity Direction, from Intel IPP), HQLI (High Quality Linear Interpolation), DFPD (Directional Filtering and a Posteriori Decision).

Debayer Benchmarks on CPU and GPU

In that testing we measured computation time with and without host-to-device and device-to-host transfers to compare throughput for the above algorithms at CPU and GPU. Here we present averaged results for 8-bit images with 4K resolution (3840×2160). PSNR and SSIM results are calculated for standard Kodak data set.

  AN3 (Bilinear) algorithm HQLI algorithm AHD algorithm DFPD algorithm
GPU, Fastvideo Debayer, only computations (Mpix/s) -- 34600 -- 13000
GPU, OpenGL, only computations (Mpix/s) -- 4300 -- --
CPU, Fastvideo Debayer (Mpix/s) -- 82 -- 20
CPU, Intel IPP-7.1 Debayer (Mpix/s) 630 -- 12 --
PSNR (in decibels) 30.4 36 37.4 39
SSIM (for RGB data) 0.873 0.965 0.968 0.978

Popular RAW converters consider AHD (AHDMF) algorithm to be industry standard. We have found how to get better results and not at the expense of performance. Here you can see moire patterns for these algorithms.

     Home                   Contacts                 Site Map
GPU Image Processing