Melon Playlist Dataset: A Public Dataset for Audio-Based Playlist Generation and Music Tagging

@article{Ferraro2021MelonPD,
  title={Melon Playlist Dataset: A Public Dataset for Audio-Based Playlist Generation and Music Tagging},
  author={Andr{\'e}s Ferraro and Yuntae Kim and Soohyeon Lee and Biho Kim and Namjun Jo and Semi Lim and Suyon Lim and Jungtaek Jang and Sehwan Kim and Xavier Serra and Dmitry Bogdanov},
  journal={ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year={2021},
  pages={536-540}
}
  • Andrés Ferraro, Yuntae Kim, +8 authors D. Bogdanov
  • Published 30 January 2021
  • Computer Science, Engineering
  • ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
One of the main limitations in the field of audio signal processing is the lack of large public datasets with audio representations and high-quality annotations due to restrictions of copyrighted commercial music. We present Melon Playlist Dataset, a public dataset of mel-spectrograms for 649,091 tracks and 148,826 associated playlists annotated by 30,652 different tags. All the data is gathered from Melon, a popular Korean streaming service. The dataset is suitable for music information… 

Figures and Tables from this paper

Enriched Music Representations With Multiple Cross-Modal Contrastive Learning
TLDR
This paper aligns the latent representations obtained from playlists-track interactions, genre metadata, and the tracks’ audio, by maximizing the agreement between these modality representations using a contrastive loss.
Music Playlist Title Generation: A Machine-Translation Approach
TLDR
This work proposes a machine-translation approach to automatically generate a playlist title from a set of music tracks, adapting the sequenceto-sequence framework based on Recurrent Neural Network and Transformer to the music data.

References

SHOWING 1-10 OF 21 REFERENCES
FMA: A Dataset for Music Analysis
TLDR
The Free Music Archive is introduced, an open and easily accessible dataset suitable for evaluating several tasks in MIR, a field concerned with browsing, searching, and organizing large music collections, and some suitable MIR tasks are discussed.
The Million Playlists Songs Dataset: a descriptive study over multiple sources of user-curated playlists
User interest for playlists is increasing as current music streaming services become more and more popular. In order to get sets of songs that best match current musical needs (e.g. size, diversity,
Recsys challenge 2018: automatic music playlist continuation
TLDR
The ACM Recommender Systems Challenge 2018 focused on automatic music playlist continuation, which is a form of the more general task of sequential recommendation, to recommend up to 500 tracks that fit the target characteristics of the original playlist.
An Analysis of Approaches Taken in the ACM RecSys Challenge 2018 for Automatic Music Playlist Continuation
TLDR
The ACM Recommender Systems Challenge 2018 focused on the task of automatic music playlist continuation, which is a form of the more general task of sequential recommendation, to recommend up to 500 tracks that fit the target characteristics of the original playlist.
Deep content-based music recommendation
TLDR
This paper proposes to use a latent factor model for recommendation, and predict the latent factors from music audio when they cannot be obtained from usage data, and shows that recent advances in deep learning translate very well to the music recommendation setting, with deep convolutional neural networks significantly outperforming the traditional approach.
Content-Aware Collaborative Music Recommendation Using Pre-trained Neural Networks
TLDR
A neural network is trained on semantic tagging information as a content model and used as a prior in a collaborative filtering model, which shows comparably better result than the collaborative filtering approaches, in addition to the favorable performance in the cold-start case.
The Effects of Noisy Labels on Deep Convolutional Neural Networks for Music Tagging
TLDR
It is shown that networks can be effective despite relatively large error rates in groundtruth datasets, while conjecturing that label noise can be the cause of varying tag-wise performance differences.
30Music Listening and Playlists Dataset
We introduce the 30Music dataset 1 , a collection of listening and playlists data retrieved from Internet radio stations through Last.fm API. In this paper we describe the creation process, its
The Million Song Dataset
TLDR
The Million Song Dataset, a freely-available collection of audio features and metadata for a million contemporary popular music tracks, is introduced and positive results on year prediction are shown, and the future development of the dataset is discussed.
Current challenges and visions in music recommender systems research
TLDR
This article identifies and shed light on what it believes are the most pressing challenges MRS research is facing, from both academic and industry perspectives and details possible future directions and visions for the further evolution of the field.
...
1
2
3
...