Visualizing music and audio using self-similarity

@inproceedings{Foote1999VisualizingMA,
  title={Visualizing music and audio using self-similarity},
  author={Jonathan Foote},
  booktitle={MULTIMEDIA '99},
  year={1999}
}
  • J. Foote
  • Published in MULTIMEDIA '99 1999
  • Computer Science
This paper presents a novel approach to visualizing the time structure of music and audio. The acoustic similarity between any two instants of an audio recording is displayed in a 2D representation, allowing identification of structural and rhythmic characteristics. Examples are presented for classical and popular music. Applications include content-based analysis and segmentation, as well as tempo and structure extraction. 
Multimodal structure segmentation and analysis of music using audio and textual information
TLDR
Since lyrics contain rich information about the semantic structure of a song, the approach incorporates lyrics to overcome the existing difficulties associated with large acoustic variation in music. Expand
The Song Picture - On Musical Information Visualization for Audio and Video Editing
TLDR
The importance of presenting to the user some hints about the melodic content of the audio to improve the visual segmentation of a music piece, thereby speeding up editing tasks is discussed. Expand
Extraction of High-Level Musical Structure From Audio Data and Its Application to Thumbnail Generation
  • M. Levy, M. Sandler, M. Casey
  • Computer Science
  • 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings
  • 2006
TLDR
New evidence is presented to show that music segmentation can be recast as clustering of timbre features, and a new clustering algorithm is described. Expand
Quantitative comparisons into content-based music recognition with the self organising map
TLDR
An ongoing investigation into the subject of musical recognition purely by the sonic content from a standard recording is described. Expand
New methods in structural segmentation of musical audio
  • M. Levy, M. Sandler
  • Computer Science
  • 2006 14th European Signal Processing Conference
  • 2006
TLDR
A semi-supervised segmentation process which finds musical structure with improved accuracy given some very limited manual input is introduced. Expand
Methods for the Automatic Analysis of Music and Audio
TLDR
This method can find individual note boundaries or even natural segment boundaries such as verse/chorus or speech/music transitions, even in the absence of cues such as silence, by analyzing local self-similarity. Expand
Midivis: Visualizing Music Structure via Similarity Matrices
TLDR
The method is automatic and enables visualizing an entire opus in a single image and unveil the structure of a piece as well as detailed themes’ leading within a piece. Expand
Improving Audio Information Visualization by Showing Melodic Content
TLDR
A visual representation is introduced which, while resembling the waveform, takes into account the melodic content of the audio, thus allowing the user to visually segment and find patterns in the media more easily. Expand
Automatic audio segmentation using a measure of audio novelty
  • J. Foote
  • Computer Science
  • 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532)
  • 2000
TLDR
This method can find individual note boundaries or even natural segment boundaries such as verse/chorus or speech/music transitions, even in the absence of cues such as silence, by analyzing local self-similarity. Expand
Adaptive Temporal Modeling of Audio Features in the Context of Music Structure Segmentation
TLDR
Evaluation of the method shows that temporal modeling is consistently adapted to different musical contexts, allowing for robust music structure segmentation while gaining independence regarding parameter tuning. Expand
...
1
2
3
4
5
...

References

SHOWING 1-10 OF 10 REFERENCES
Content-based retrieval of music and audio
  • J. Foote
  • Computer Science, Engineering
  • Other Conferences
  • 1997
TLDR
A system to retrieve audio documents y acoustic similarity based on statistics derived from a supervised vector quantizer, rather than matching simple pitch or spectral characteristics, which may be applicable to image retrieval as well. Expand
A visualization of music
TLDR
The implementation of this method shows that music visualization is an effective technique, although it is certainly not the only possible method for accomplishing the task. Expand
A comparison of features for speech, music discrimination
TLDR
This paper examines the discrimination achieved by several different features using common training and test sets and the same classifier on four types of feature, amplitude, cepstra, pitch and zero-crossings. Expand
Fundamentals of speech recognition
TLDR
This book presents a meta-modelling framework for speech recognition that automates the very labor-intensive and therefore time-heavy and therefore expensive and expensive process of manually modeling speech. Expand
A Comparision of Features for Speech and Music Discrimination
  • Proc. ICASSP ‘99, vol. 1, pp. 149-152, IEEE, Phoenix AZ 1999
  • 1999
Section 0.6.2, http:// www.faqs.orglfaqs/skeptic-faq/l
  • sci.skeptic FAQ
  • 1999
sci.skeptic FAQ
  • Section 0.6.2, http:// www.faqs.org/faqs/skeptic-faq/ 1, 1999 1 The author would appreciate any pointers to more authoritative references.
  • 1999
sci.skeptic FAQ Section 0.6.2, http:// www.faqs.org/faqs/skeptic-faq
  • sci.skeptic FAQ Section 0.6.2, http:// www.faqs.org/faqs/skeptic-faq
  • 1999
The Music Animation Machine
  • http://www.well.com/user/smalin/mam.html