• Publications
  • Influence
Content-Based Music Information Retrieval: Current Directions and Future Challenges
The problems of content-based music information retrieval are outlined and the state-of-the-art methods using audio cues and other cues are explored, and some of the major challenges for the coming years are identified.
Separation of Mixed Audio Sources By Independent Subspace Analysis
We propose the method of independent subspace analysis (ISA) for separating individual audio sources from a single-channel mixture. ISA is based on independent component analysis (ICA) but relaxes
Analysis of Minimum Distances in High-Dimensional Musical Spaces
An automatic method for measuring content-based music similarity, enhancing the current generation of music search engines and recommended systems and compatible with locality-sensitive hashing-allowing implementation with retrieval times several orders of magnitude faster than those using exhaustive distance computations.
Locality-Sensitive Hashing for Finding Nearest Neighbors
This lecture note describes a technique known as locality-sensitive hashing (LSH) that allows one to quickly find similar entries in large databases using a novel and interesting class of algorithms known as randomized algorithms.
The Importance of Sequences in Musical Similarity
  • M. Casey, M. Slaney
  • Computer Science
    IEEE International Conference on Acoustics Speech…
  • 14 May 2006
A significant improvement in performance for audio similarity measures using temporal sequences of features is demonstrated, and it is shown that quantizing the features to string-based representations also performs well, thus admitting efficient implementations based on string matching.
Musical Audio Synthesis Using Autoencoding Neural Nets
An interactive musi- cal audio synthesis system that uses feedforward artificial neural networks for musical audio synthesis, rather than discriminative or regression tasks, and allows one to interact directly with the parameters of the model and generate musical audio in real time.
General sound classification and similarity in MPEG-7
  • M. Casey
  • Computer Science
    Organised Sound
  • 1 August 2001
A system for generalised sound classification and similarity using a machine-learning framework that has been incorporated into the MPEG-7 international standard for multimedia content description and is therefore publicly available in the form of standardised interfaces and software reference tools for developers and researchers.
Auditory group theory with applications to statistical basis methods for structured audio
Thesis (Ph. D.)--Massachusetts Institute of Technology, Program in Media Arts & Sciences, 1998.
Song Intersection by Approximate Nearest Neighbor Search
To scale the search to large song databases, an algorithm based on localitysensitive hashing (LSH) of sequences of audio features called audio shingles provides an efficient means to identify approximate nearest neighbors in a high-dimensional feature space.
Diamond Park and Spline: A Social Virtual Reality System with 3D Animation
Diamond Park is a social virtual reality system in which multiple geographically separated users can speak to each other and participate in joint activities. The central theme of the park is cycling.