Video Compression through Image Interpolation

@inproceedings{Wu2018VideoCT,
  title={Video Compression through Image Interpolation},
  author={Chao-Yuan Wu and Nayan Singhal and Philipp Kr{\"a}henb{\"u}hl},
  booktitle={ECCV},
  year={2018}
}
An ever increasing amount of our digital communication, media consumption, and content creation revolves around videos. We share, watch, and archive many aspects of our lives through them, all of which are powered by strong video compression. Traditional video compression is laboriously hand designed and hand optimized. This paper presents an alternative in an end-to-end deep learning codec. Our codec builds on one simple idea: Video compression is repeated image interpolation. It thus benefits… Expand
Learning Image and Video Compression Through Spatial-Temporal Energy Compaction
TLDR
This paper presents an image compression architecture using a convolutional autoencoder, and then generalizes image compression to video compression, by adding an interpolation loop into both encoder and decoder sides, to achieve higher image compression performance. Expand
Neural Inter-Frame Compression for Video Coding
TLDR
This work presents an inter-frame compression approach for neural video coding that can seamlessly build up on different existing neural image codecs and proposes to compute residuals directly in latent space instead of in pixel space to reuse the same image compression network for both key frames and intermediate frames. Expand
DVC: An End-To-End Deep Video Compression Framework
TLDR
This paper proposes the first end-to-end video compression deep model that jointly optimizes all the components for video compression, and shows that the proposed approach can outperform the widely used video coding standard H.264 in terms of PSNR and be even on par with the latest standard MS-SSIM. Expand
Learned image and video compression with deep neural networks
TLDR
This tutorial will cover both the traditional data coding techniques and the popular learning based visual data compression algorithms, which will help the audiences with different backgrounds learn the recent progresses in this emerging research area. Expand
Deep Generative Video Compression
TLDR
This work proposes an end-to-end, deep generative modeling approach to compress temporal sequences with a focus on video that builds upon variational autoencoder models for sequential data and combines them with recent work on neural image compression. Expand
Deep Generative Video Compression
The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose anExpand
Deep Generative Video Compression
The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video compression is still in its infancy. Here, we propose anExpand
Learned Video Compression
TLDR
This work presents a new algorithm for video coding, learned end-to-end for the low-latency mode, which outperforms all existing video codecs across nearly the entire bitrate range, and is the first ML-based method to do so. Expand
Deep Frame Interpolation for Video Compression
TLDR
This paper presents a deep frame interpolation network for video compression aiming at solving the previous limitations, i.e. able to cope with all types of geometrical deformations by providing a dense motion compensation. Expand
End-to-End Learning of Video Compression using Spatio-Temporal Autoencoders
TLDR
This work formalizes it as the rate-distortion optimization of a single spatio-temporal autoencoder, by jointly learning a latent-space projection transform, and a synthesis transform for low-bitrate video compression, and shows competitive performance against standard codecs. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 34 REFERENCES
MPEG: a video compression standard for multimedia applications
TLDR
Design of the MPEG algorithm presents a difficult challenge since quality requirements demand high compression that cannot be achieved with only intraframe coding, and the algorithm’s random access requirement is best satisfied with pure intraframes coding. Expand
Real-Time Adaptive Image Compression
TLDR
A machine learning-based approach to lossy image compression which outperforms all existing codecs, while running in real-time, and supplementing the approach with adversarial training specialized towards use in a compression setting. Expand
Lossy Image Compression with Compressive Autoencoders
TLDR
It is shown that minimal changes to the loss are sufficient to train deep autoencoders competitive with JPEG 2000 and outperforming recently proposed approaches based on RNNs, and furthermore computationally efficient thanks to a sub-pixel architecture, which makes it suitable for high-resolution images. Expand
Compressed Video Action Recognition
TLDR
This work proposes to train a deep network directly on the compressed video, using H.264, HEVC, etc., which has a higher information density, and found the training to be easier. Expand
Learning to Inpaint for Image Compression
TLDR
Predicting the original image data from residuals in a multi-stage progressive architecture facilitates learning and leads to improved performance at approximating the original content and learning to inpaint before performing compression reduces the amount of information that must be stored to achieve a high-quality approximation. Expand
Video Frame Synthesis Using Deep Voxel Flow
TLDR
This work addresses the problem of synthesizing new video frames in an existing video, either in-between existing frames (interpolation), or subsequent to them (extrapolation), by training a deep network that learns to synthesize video frames by flowing pixel values from existing ones, which is called deep voxel flow. Expand
Learning Binary Residual Representations for Domain-specific Video Streaming
TLDR
This work proposes a novel video compression pipeline that first applies H.264 to compress domain-specific videos, then trains a novel binary autoencoder to encode the leftover domain- specific residual information frame-by-frame into binary representations. Expand
Video Codec Design: Developing Image and Video Compression Systems
TLDR
Video Codec Design sets out to de-mystify the subject of video coding and present a practical, design-based approach to this emerging field, providing communications engineers, system designers, researchers and technical managers with an essential handbook to image and video compression technology. Expand
Super SloMo: High Quality Estimation of Multiple Intermediate Frames for Video Interpolation
TLDR
This work proposes an end-to-end convolutional neural network for variable-length multi-frame video interpolation, where the motion interpretation and occlusion reasoning are jointly modeled. Expand
Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks
TLDR
A method for lossy image compression based on recurrent, convolutional neural networks that outperforms BPG, WebP, JPEG2000, and JPEG as measured by MS-SSIM is proposed and it is shown that training with a pixel-wise loss weighted by SSIM increases reconstruction quality according to multiple metrics. Expand
...
1
2
3
4
...