Timbre analysis of music audio signals with convolutional neural networks

@article{Pons2017TimbreAO,
  title={Timbre analysis of music audio signals with convolutional neural networks},
  author={Jordi Pons and Olga Slizovskaia and Rong Gong and Emilia G{\'o}mez and Xavier Serra},
  journal={2017 25th European Signal Processing Conference (EUSIPCO)},
  year={2017},
  pages={2744-2748}
}
The focus of this work is to study how to efficiently tailor Convolutional Neural Networks (CNNs) towards learning timbre representations from log-mel magnitude spectrograms. We first review the trends when designing CNN architectures. Through this literature overview we discuss which are the crucial points to consider for efficiently learning timbre representations using CNNs. From this discussion we propose a design strategy meant to capture the relevant time-frequency contexts for learning… CONTINUE READING

Figures, Tables, and Topics from this paper.

Explore Further: Topics Discussed in This Paper

Citations

Publications citing this paper.
SHOWING 1-10 OF 34 CITATIONS

End-to-end Learning for Music Audio Tagging at Scale

  • ISMIR
  • 2017
VIEW 10 EXCERPTS
CITES METHODS, RESULTS & BACKGROUND

A Feature Learning Siamese Model for Intelligent Control of the Dynamic Range Compressor

  • 2019 International Joint Conference on Neural Networks (IJCNN)
  • 2019
VIEW 7 EXCERPTS
CITES BACKGROUND
HIGHLY INFLUENCED

Accent Detection Within the Amateur Singing Voice

Sarah Ciresi, Vidya Rangasayee
  • 2019
VIEW 4 EXCERPTS
CITES METHODS & BACKGROUND
HIGHLY INFLUENCED

Randomly Weighted CNNs for (Music) Audio Classification

  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2018
VIEW 8 EXCERPTS
CITES BACKGROUND

Training Neural Audio Classifiers with Few Data

  • ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2018
VIEW 7 EXCERPTS
CITES BACKGROUND & METHODS

References

Publications referenced by this paper.
SHOWING 1-10 OF 21 REFERENCES

Designing efficient architectures for modeling temporal features with convolutional neural networks

  • 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2017
VIEW 18 EXCERPTS

Deep Convolutional Neural Networks for Predominant Instrument Recognition in Polyphonic Music

  • IEEE/ACM Transactions on Audio, Speech, and Language Processing
  • 2017
VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

End-to-end learning for music audio

  • 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
  • 2014
VIEW 5 EXCERPTS
HIGHLY INFLUENTIAL

Experimenting with musically motivated convolutional neural networks

  • 2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)
  • 2016
VIEW 1 EXCERPT