Audio-Based Semantic Concept Classification for Consumer Video

@article{Lee2010AudioBasedSC,
  title={Audio-Based Semantic Concept Classification for Consumer Video},
  author={Keansub Lee and Daniel P. W. Ellis},
  journal={IEEE Transactions on Audio, Speech, and Language Processing},
  year={2010},
  volume={18},
  pages={1406-1416}
}
This paper presents a novel method for automatically classifying consumer video clips based on their soundtracks. We use a set of 25 overlapping semantic classes, chosen for their usefulness to users, viability of automatic detection and of annotator labeling, and sufficiency of representation in available video collections. A set of 1873 videos from real users has been annotated with these concepts. Starting with a basic representation of each video clip as a sequence of mel-frequency cepstral… CONTINUE READING

Citations

Publications citing this paper.
SHOWING 1-10 OF 78 CITATIONS

Event-based Video Retrieval Using Audio

  • INTERSPEECH
  • 2012
VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Entropy-based pruning method for convolutional neural networks

  • The Journal of Supercomputing
  • 2018
VIEW 2 EXCERPTS
CITES METHODS
HIGHLY INFLUENCED

AENet: Learning Deep Audio Features for Video Analysis

  • IEEE Transactions on Multimedia
  • 2017
VIEW 5 EXCERPTS
CITES BACKGROUND & METHODS
HIGHLY INFLUENCED

Smart Ambient Sound Analysis via Structured Statistical Modeling

VIEW 8 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Speech/music discrimination in a large database of radio broadcasts from the wild

  • 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2014
VIEW 4 EXCERPTS
HIGHLY INFLUENCED

FILTER CITATIONS BY YEAR

2009
2019

CITATION STATISTICS

  • 10 Highly Influenced Citations

References

Publications referenced by this paper.
SHOWING 1-10 OF 25 REFERENCES

Variational Bhattacharyya divergence for hidden Markov models

  • 2008 IEEE International Conference on Acoustics, Speech and Signal Processing
  • 2008
VIEW 6 EXCERPTS
HIGHLY INFLUENTIAL

Detecting music in ambient audio by long-window autocorrelation

  • 2008 IEEE International Conference on Acoustics, Speech and Signal Processing
  • 2008
VIEW 1 EXCERPT

Kodak consumer video benchmark data set: Concept definition and annotation

S.-F. Chang, D. Ellis, +4 authors J. Luo
  • Proc. MIR Workshop, ACM Multimedia, Germany, Sep. 2007.
  • 2007
VIEW 2 EXCERPTS

PLSA on Large Scale Image Databases

  • 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07
  • 2007
VIEW 1 EXCERPT

Audio-based context recognition

  • IEEE Transactions on Audio, Speech, and Language Processing
  • 2006
VIEW 2 EXCERPTS

Classifying user environment for mobile applications using linear autoencoding of ambient audio

  • Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005.
  • 2005
VIEW 1 EXCERPT

Similar Papers

Loading similar papers…