YouTube UGC Dataset for Video Compression Research

  title={YouTube UGC Dataset for Video Compression Research},
  author={Yilin Wang and Sasi Inguva and Balu Adsumilli},
  journal={2019 IEEE 21st International Workshop on Multimedia Signal Processing (MMSP)},
Non-professional video, commonly known as User Generated Content (UGC) has become very popular in today's video sharing applications. [] Key Method Besides a novel sampling method based on features extracted from encoding, challenges for UGC compression and quality evaluation are also discussed. Shortcomings of traditional reference-based metrics on UGC are addressed. We demonstrate a promising way to evaluate UGC quality by no-reference objective quality metrics, and evaluate the current dataset with three…

Figures and Tables from this paper

UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content
This work conducts a comprehensive evaluation of leading no-reference/blind VQA (BVQA) features and models on a fixed evaluation architecture, yielding new empirical insights on both subjective video quality studies and objective V QA model design.
Rich features for perceptual quality assessment of UGC videos
A large scale dataset is created to comprehensively investigate characteristics of generic UGC video quality and a DNN-based framework is proposed to thoroughly analyze importance of content, technical quality, and compression level in perceptual quality.
Predicting the Quality of Compressed Videos With Pre-Existing Distortions
1stepVQA overcomes limitations of Full-Reference, Reduced-Reference and No-Reference VQA models by exploiting the statistical regularities of both natural videos and distorted videos, and is able to more accurately predict the quality of compressed videos, given imperfect reference videos.
Subjective and objective study of sharpness enhanced UGC video quality
A statistical model is proposed to precisely measure whether the quality of UGC videos is enhanced or degraded and benchmark state-of-the-art No-Reference image or video quality metrics with the collected subjective data.
On quality control of user-generated-content (UGC) compression
A novel perceptual model on compressed UGC video quality is proposed by exploiting characteristics extracted from only source video and can accurately model quality curves for various UGC videos and control compression quality well.
Subjective Quality Assessment For Youtube Ugc Dataset
This work analyzed the distribution of Mean Opinion Score (MOS) in various dimensions, and investigated some fundamental questions in video quality assessment, like the correlation between full video MOS and corresponding chunk MOS, and the influence of chunk variation in quality score aggregation.
PUGCQ: A Large Scale Dataset for Quality Assessment of Professional User-Generated Content
A large domain gap between the PUGC and the traditional user-generated videos is revealed and shed light on developing next-generation PUGC quality assessment algorithms with desired properties including promising generalization capability, high accuracy, and effectiveness in perceptual optimization.
BVI-CC: A Dataset for Research on Video Compression and Quality Assessment
A dataset, the BVI-CC, which can be used for coding performance evaluation and video quality metric development, and nine source video sequences were carefully selected to offer both diversity and representativeness in the spatio-temporal domain.
Subjective Quality Assessment of User-Generated Content Gaming Videos
Benefited from the rapid development of the digital game industry, the growing popularity of online user-generated content (UGC) videos for games has accelerated the development of perceptual video
RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content
RAPIQUE combines and leverages the advantages of both quality-aware scene statistics features and semantics-aware deep convolutional features, allowing it to design the first general and efficient spatial and temporal (space-time) bandpass statistics model for video quality modeling.


A no-reference video quality predictor for compression and scaling artifacts
The proposed quality predictor called Self-reference based LEarning-free Evaluator of Quality (SLEEQ) consists of three components: feature extraction in the spatial and temporal domains, motion-based feature fusion, and spatial-temporal feature pooling to derive a single quality score for a given video.
The Konstanz natural video database (KoNViD-1k)
  • Vlad Hosu, F. Hahn, D. Saupe
  • Computer Science
    2017 Ninth International Conference on Quality of Multimedia Experience (QoMEX)
  • 2017
KoNViD-1k is reported on, a subjectively annotated VQA database consisting of 1,200 public-domain video sequences, fairly sampled from a large public video dataset, YFCC100m, aimed at ‘in the wild’ authentic distortions.
YouTube-8M: A Large-Scale Video Classification Benchmark
YouTube-8M is introduced, the largest multi-label video classification dataset, composed of ~8 million videos (500K hours of video), annotated with a vocabulary of 4800 visual entities, and various (modest) classification models are trained on the dataset.
Large Scale Subjective Video Quality Study
  • Zeina Sinno, A. Bovik
  • Computer Science
    2018 25th IEEE International Conference on Image Processing (ICIP)
  • 2018
This work carefully designed a framework in Amazon Mechanical Turk (AMT) to address the many technical issues that are faced and has verified that the framework provided results that are highly consistent with the ones obtained in a lab environment under controlled conditions.
Study of Subjective and Objective Quality Assessment of Video
A recent large-scale subjective study of video quality on a collection of videos distorted by a variety of application-relevant processes results in a diverse independent public database of distorted videos and subjective scores that is freely available.
Optimized Transcoding for Large Scale Adaptive Streaming Using Playback Statistics
Measurements on the actual usage of millions of video clips are used to create probability distributions of available bandwidth and viewport sizes and the resulting scheme demonstrates an overall bandwidth reduction of 9.7% compared with existing techniques without loss of delivered quality.
A No-reference Perceptual Quality Metric for Videos Distorted by Spatially Correlated noise
This paper builds a video database that simulates the commonly encountered noise characteristics of videos, and develops a perceptual quality assessment algorithm that explicitly incorporates the noise correlations.
Analysis of Public Image and Video Databases for Quality Assessment
  • Stefan Winkler
  • Computer Science
    IEEE Journal of Selected Topics in Signal Processing
  • 2012
Several criteria for quantitative comparisons of source content, test conditions, and subjective ratings are proposed, which will allow researchers to make more well-informed decisions about databases and may also guide the creation of additional test material and the design of future experiments.
A perceptual visibility metric for banding artifacts
A novel metric that incorporates both edge length and contrast across the edge to measure video banding is proposed that outperforms PSNR, SSIM, and VQM and also introduces both reference and non-reference metrics.
Image quality assessment: from error visibility to structural similarity
A structural similarity index is developed and its promise is demonstrated through a set of intuitive examples, as well as comparison to both subjective ratings and state-of-the-art objective methods on a database of images compressed with JPEG and JPEG2000.