Determining the emotion of a song that best characterizes the affective content of the song is a challenging issue due to the difficulty of collecting reliable ground truth data and the semantic gap between human's perception and the music signal of the song. To address this issue, we represent an emotion as a point in the Cartesian space with valence and… (More)
Fig. 2. Distributions of the ground truth (squares) and the recognition result (filled circles) of the regression method proposed in . It can be observed that, due to the difficulty of accurately computing the emotion values, the range of the emotion estimate is much smaller than that of the ground truth. The rankingbased approach is free of this problem because songs associated with topmost/ lowermost rankings are assigned with the highest/lowest emotion values, producing a full coverage of the 2-D emotion space.