One-for-All: Grouped Variation Network Based Fractional Interpolation in Video Coding


Abstract

Fractional interpolation is used to provide sub-pixel level references for motion compensation in the inter prediction of video coding, which attempts to remove temporal redundancy in video sequences. Traditional handcrafted fractional interpolation filters face the challenge of modeling discontinuous regions in videos, while existing deep learning based methods are either designed for a single quantization parameter (QP), only generate half-pixel samples, or need to train a model for each subpixel position. In this paper, we present a one-for-all fractional interpolation method based on grouped variation convolutional neural network (GVCNN). Our method can deal with video frames coded using different QPs and is capable of generating all sub-pixel positions at one sub-pixel level. Also, by predicting variations between integer-position pixels and sub-pixels, our network offers more expressive power. Moreover, we perform specific measurements in training data generation to simulate practical situations in video coding, including blurring the downsampled sub-pixel samples to avoid aliasing effects and coding integer pixels to simulate reconstruction errors. In addition, we analyze the impact of the size of blur kernels theoretically. Experimental results verify the efficiency of GVCNN. Compared with HEVC, our method achieves 2:2% in bit saving on average and up to 5:2% under low-delay P configuration.

Framework

Figure. 1. Framework of the proposed GVCNN. The network first extracts feature maps from the integer-position sample. Then the group variations that identify the differences between different sub-pixel position samples and the integer-position sample are inferred using the same feature maps. Final results of sub-pixel position samples are naturally obtained by adding the variations back to the integer-position sample.

Results

Table 1: BD-rate reduction of the proposed method compared to HEVC.

Download

  • Paper
  • Code
  • Citation

    @ARTICLE{GVCNN,    author={J. Liu and S. Xia and W. Yang and M. Li and D. Liu},    journal={IEEE Transactions on Image Processing},    title={One-for-All: Grouped Variation Network Based Fractional Interpolation in Video Coding},    year={2019},    }