Corpus ID: 8331534

Segmentation-Based Lyrics-Audio Alignment using Dynamic Programming

@inproceedings{Lee2008SegmentationBasedLA,
  title={Segmentation-Based Lyrics-Audio Alignment using Dynamic Programming},
  author={Kyogu Lee and M. Cremer},
  booktitle={ISMIR},
  year={2008}
}
In this paper, we present a system for automatic alignment of textual lyrics with musical audio. Given an input audio signal, structural segmentation is first performed and similar segments are assigned a label by computing the distance between the segment pairs. Using the results of segmentation and hand-labeled paragraphs in lyrics as a pair of input strings, we apply a dynamic programming (DP) algorithm to find the best alignment path between the two strings, achieving segment-to-paragraph… Expand

Figures, Tables, and Topics from this paper

Lyrics-to-Audio Alignment and its Application
TLDR
An overview of recent development in lyrics-to-audio alignment techniques is provided, where a particular focus on categorization of various methods and on applications is put on. Expand
Word level lyrics-audio synchronization using separated vocals
  • S. Lee, Jeffrey J. Scott
  • Computer Science
  • 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2017
TLDR
This paper presents an approach for lyric-audio alignment by comparing synthesized speech with a vocal track removed from an instrument mixture using source separation, taking a hierarchical approach to solve the problem. Expand
LYRICS-AUDIO SYNCHRONIZATION USING SEPARATED VOCALS
The massive amount of digital music data available necessitates automated methods for processing, classifying and organizing large volumes of songs. As music discovery and interactive musicExpand
Automatic Lyrics-to-audio Alignment on Polyphonic Music Using Singing-adapted Acoustic Models
TLDR
It is demonstrated that the use of audio source separation method and effective end-pointing of the songs has a high impact on the alignment performance through the experiments, and is comparable with the state-of-the-art lyrics-to-audio alignment system that is trained on a large polyphonic music database. Expand
Leveraging repetition for improved automatic lyric transcription in popular music
TLDR
This paper investigates how lyrics from musical audio can be leveraged to form a consensus transcription with improved consistency and accuracy, and shows that improvements can be gained using a variety of techniques. Expand
A Strategy for Improved Phone-Level Lyrics-to-Audio Alignment for Speech-to-Singing Synthesis
TLDR
A complete pipeline for automatic phone-level lyrics-to-audio alignment based on an HMM-based forced-aligner and singing acoustics normalization is proposed and the smoothness of the singing voice generated with the proposed methodology was found close to the one obtained using manual alignments. Expand
Lyrics-to-Audio Alignment by Unsupervised Discovery of Repetitive Patterns in Vowel Acoustics
TLDR
Experiments with Korean and English data sets showed that deploying this method after a pre-developed, unsupervised, singing source separation achieved more promising results than the other state-of-the-art unsuper supervised approaches and an existing ASR-based system. Expand
Automatic Beat Alignment of Rap Lyrics
Rap is characterized by highly rhythmic delivery of words; the subtle ways in which syllables in the lyrics fit into the beats in the music lead to expressivity in rap music. Unfortunately, given aExpand
A Semantics-Driven Approach to Lyrics Segmentation
TLDR
A semantics-driven approach to the automatic segmentation of song lyrics by taking into account the basic formatting commonly in use for lyrics on CD booklets and specialized Web sites in order to extract basic semantic information, such as the organization in lines and sections. Expand
MTSSM - A Framework for Multi-Track Segmentation of Symbolic Music
TLDR
The authors of this paper present the MTSSM framework, a twolayer framework for the multi-track segmentation of symbolic music, a combination of existing methods for local track segmentation and the application of global structure information spanning via multiple tracks. Expand
...
1
2
3
...

References

SHOWING 1-10 OF 11 REFERENCES
LyricAlly: automatic synchronization of acoustic musical signals and textual lyrics
TLDR
A prototype that automatically aligns acoustic musical signals with their corresponding textual lyrics, in a manner similar to manually-aligned karaoke, is presented, using a multimodal approach. Expand
LyricAlly: Automatic Synchronization of Textual Lyrics to Acoustic Music Signals
TLDR
LyricAlly is presented, a prototype that automatically aligns acoustic musical signals with their corresponding textual lyrics, in a manner similar to manually-aligned karaoke, using an appropriate pairing of audio and text processing. Expand
New methods in structural segmentation of musical audio
  • M. Levy, M. Sandler
  • Computer Science
  • 2006 14th European Signal Processing Conference
  • 2006
TLDR
A semi-supervised segmentation process which finds musical structure with improved accuracy given some very limited manual input is introduced. Expand
Automatic audio segmentation using a measure of audio novelty
  • J. Foote
  • Computer Science
  • 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532)
  • 2000
TLDR
This method can find individual note boundaries or even natural segment boundaries such as verse/chorus or speech/music transitions, even in the absence of cues such as silence, by analyzing local self-similarity. Expand
Automatic Music Summarization via Similarity Analysis
TLDR
This work presents methods for automatically producing summary excerpts or thumbnails of music, and demonstrates that the method finds significantly representative excerpts, using very few assumptions about the source audio. Expand
Segmentation of Musical Signals Using Hidden Markov Models.
In this paper, we present a segmentation algorithm for acoustic musical signals, using a hidden Markov model. Through unsupervised learning, we discover regions in the music that present steadyExpand
Popular song and lyrics synchronization and its application to music information retrieval
TLDR
This is the first automatic synchronization system only based on the low-level acoustic feature such as MFCC and it is evaluated on a Chinese song dataset collecting from 3 popular singers to open up the discussion of some challenging problems when developing a robust synchronization system for largescale database. Expand
Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals
TLDR
A system that can automatically synchronize between polyphonic musical audio signals and corresponding lyrics and a method for adapting a speech-recognizer phone model to segregated vocal signals is described. Expand
Visualizing music and audio using self-similarity
  • J. Foote
  • Computer Science
  • MULTIMEDIA '99
  • 1999
TLDR
The acoustic similarity between any two instants of an audio recording is displayed in a 2D representation, allowing identification of structural and rhythmic characteristics, as well as tempo and structure extraction. Expand
Summarizing popular music via structural similarity analysis
  • M. Cooper, J. Foote
  • Computer Science
  • 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684)
  • 2003
TLDR
A framework for summarizing digital media based on structural analysis on characterizing the repetitive structure in popular music by combining segments representing the clusters most frequently repeated throughout the piece is presented. Expand
...
1
2
...