Video Compression - From Concepts to the H.264/AVC Standard

  title={Video Compression - From Concepts to the H.264/AVC Standard},
  author={Gary J. Sullivan and Thomas Wiegand},
  journal={Proceedings of the IEEE},
Over the last one and a half decades, digital video compression technologies have become an integral part of the way we create, communicate, and consume visual information. In this paper, techniques for video compression are reviewed, starting from basic concepts. The rate-distortion performance of modern video compression schemes is the result of an interaction between motion representation techniques, intra-picture prediction techniques, waveform coding of differences, and waveform coding of… 

Figures from this paper

Comparative Performance Issues with H.264 vs H.265

This work presents a comparative analysis between these two standards and represents the more reduction in size of H.265 video as compared to H.264 video.

An Efficient Video Compression System Based on LSK Encoder

The main goal of this research is to develop a robust video compression system that is assessed by several videos and the effectiveness of compression is evaluated by measuring the PSNR values for the compression results.

FPGA Based Architectures for H. 264/AVC Video Compression Standard

  • L. AgostiniS. Bampi
  • Computer Science
    2006 International Conference on Field Programmable Logic and Applications
  • 2006
This work focuses on the design of high performance architectures for the H.264/AVC standard, which has been developed to achieve significant improvements, in the compression performance, over the existing standards.

Spatial Scalability Within the H.264/AVC Scalable Video Coding Extension

The spatially scalable extension within the resulting scalable video coding standard is introduced and the high-level design is described and individual coding tools are explained.

Digital Video Compression

Frame differencing-based segmentation for low bit rate video codec using H.264

In video sequence coding, a combination of temporal and spatial coding technique is used in order to remove the predictable or redundant image content and encode only the unpredictable information.

Compression performance and video quality comparison of HEVC and AVC

  • H. KoumarasM. KourtisC. Skianis
  • Computer Science
    2014 IEEE 19th International Workshop on Computer Aided Modeling and Design of Communication Links and Networks (CAMAD)
  • 2014
It is deduced that the HEVC meets its primary objective, which is to double the compression efficiency of the bit stream without significant degradation of the encoded video quality.

Learning for Video Compression

The proposed PixelMotionCNN (PMCNN) which includes motion extension and hybrid prediction networks can model spatiotemporal coherence to effectively perform predictive coding inside the learning network and provides a possible new direction to further improve compression efficiency and functionalities of future video coding.

Implementation of Hybrid MC/DCT video codec

The Hybrid MC/DCT video codec standard is examined and a video compression algorithm based on the present standard is implemented, to provide the least compromise between the computational speed and compression ratio.


This paper provides an overview of the H.264 features and summarizes the emerging studies related to new coding features of the standard and provides new possibilities for creating better video encoders and decoders that provide higher quality video streams at maintained bit-rates (compared to previous standards).



Overview of the H.264/AVC video coding standard

An overview of H.264/MPEG-4 AVC is provided and some comparisons with the existing standards, MPEG-2 and MPEG-4 Part 2, are also provided.

Rate-distortion optimization for video compression

The rate-distortion efficiency of video compression schemes is based on a sophisticated interaction between various motion representation possibilities, waveform coding of differences, and waveform

Enhanced video compression with standardized bit stream syntax

  • S. WuA. Gersho
  • Computer Science
    1993 IEEE International Conference on Acoustics, Speech, and Signal Processing
  • 1993
The authors formulate the two separate problems of encoding and decoding as two individual problems and examine some possible ways to enhance performance over previous algorithms.

H.264/AVC over IP

  • S. Wenger
  • Computer Science
    IEEE Trans. Circuits Syst. Video Technol.
  • 2003
The paper describes the use of H.264 coded video over best-effort IP networks, using RTP as the real-time transport protocol.

Generalized B pictures and the draft H.264/AVC video-compression standard

It is demonstrated how the quality of the B pictures should be reduced to improve the overall rate-distortion performance of the scalable representation and shown that the gains by multihypothesis prediction and arithmetic coding are additive.

Rate-constrained coder control and comparison of video coding standards

A unified approach to the coder control of video coding standards such as MPEG-2, H.263, MPEG-4, and the draft video coding standard H.264/AVC (advanced video coding) is presented. The performance of

The SP- and SI-frames design for H.264/AVC

It is shown that SP-frames have significantly better coding efficiency than I-frames while providing similar functionalities.

Interframe Redundancy Reduction of Video Signals Generated by Translating Objects

In interframe coding the content of the frame memory can also be utilized for the prediction of the luminances of moving area picture elements (pels), provided that their displacement has been

Distributed Video Coding

The recent development of practical distributed video coding schemes is reviewed, finding that the rate-distortion performance is superior to conventional intraframe coding, but there is still a gap relative to conventional motion-compensated interframe coding.

A video encoding system with conditional picture-element replenishment

An experimental method for encoding television signals which takes advantage of the frame-to-frame correlation to reduce transmission bit rate is described, demonstrated in real-time using the head-and-shoulder view of a person in animated conversation as the picture source.