• Publications
  • Influence
Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music
TLDR
We present a convolutional neural network framework for predominant instrument recognition in real-world polyphonic music. Expand
  • 106
  • 15
  • PDF
Rare Sound Event Detection Using 1D Convolutional Recurrent Neural Networks
TLDR
We apply 1D CRNN which is a combination of 1D convolutional neural network (1D ConvNet) and recurrent neural Network (RNN) with long short-term memory units (LSTM) for rare sound event detection task. Expand
  • 75
  • 11
  • PDF
Phase-aware Speech Enhancement with Deep Complex U-Net
TLDR
We propose Deep Complex U-net, an advanced U-Net structured model incorporating well-defined complexvalued building blocks to deal with complex-valued spectrograms. Expand
  • 50
  • 11
  • PDF
Convolutional neural networks with binaural representations and background subtraction for acoustic scene classification
In this paper, we demonstrate how we applied convolutional neural network for DCASE 2017 task 1, acoustic scene classification. We propose a variety of preprocessing methods that emphasise differentExpand
  • 89
  • 10
  • PDF
Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio
  • K. Lee, M. Slaney
  • Computer Science
  • IEEE Transactions on Audio, Speech, and Language…
  • 1 February 2008
TLDR
We describe an acoustic chord transcription system that uses symbolic data to train hidden Markov models and gives best-of-class frame-level recognition results. Expand
  • 141
  • 9
  • PDF
Automatic Chord Recognition from Audio Using Enhanced Pitch Class Profile
  • K. Lee
  • Computer Science
  • ICMC
  • 2006
TLDR
In this paper, a feature vector called the Enhanced Pitch Class Profile (EPCP) is introduced for automatic chord recognition from the raw audio. Expand
  • 93
  • 7
  • PDF
Ensemble of Convolutional Neural Networks for Weakly-Supervised Sound Event Detection 
 using Multiple Scale Input
In this paper, we propose to use an ensemble of convolutional neural networks to detect audio events in the automotive environment. Each of the networks is based on various lengths of analysisExpand
  • 46
  • 6
  • PDF
Adversarially Trained End-to-end Korean Singing Voice Synthesis System
TLDR
We propose an end-to-end Korean singing voice synthesis system from lyrics and a symbolic melody using the following three novel approaches: 1) phonetic enhancement masking, 2) local conditioning of text and pitch to the super-resolution network, and 3) conditional adversarial training. Expand
  • 26
  • 5
  • PDF
#nowplaying the future billboard: mining music listening behaviors of twitter users for hit song prediction
TLDR
We investigate the relationship between the music listening behaviors of Twitter users and a popular music ranking service by comparing information extracted from tweets with music-related hashtags and the Billboard chart. Expand
  • 24
  • 5
Acoustic scene classification using convolutional neural network and multiple-width frequency-delta data augmentation
TLDR
We propose a multiple-width frequency-delta (MWFD) data augmentation method for acoustic scene classification, and show that the error rate can be further decreased by using delta features in the frequency domain. Expand
  • 39
  • 4
  • PDF
...
1
2
3
4
5
...