Automatic music transcription: challenges and future directions


Automatic music transcription is considered by many to be a key enabling technology in music signal processing. However, the performance of transcription systems is still significantly below that of a human expert, and accuracies reported in recent years seem to have reached a limit, although the field is still very active. In this paper we analyse limitations of current methods and identify promising directions for future research. Current transcription methods use general purpose models which are unable to capture the rich diversity found in music signals. One way to overcome the limited performance of transcription systems is to tailor algorithms to specific use-cases. Semi-automatic approaches are another way of achieving a more reliable transcription. Also, the wealth of musical scores and corresponding audio data now available are a rich potential source of training data, via forced alignment of audio to scores, but large scale utilisation of such data has yet to be attempted. Other promising approaches include the integration of information from multiple algorithms and different musical aspects.

DOI: 10.1007/s10844-013-0258-3

Extracted Key Phrases

6 Figures and Tables

Citations per Year

118 Citations

Semantic Scholar estimates that this publication has 118 citations based on the available data.

See our FAQ for additional information.

Cite this paper

@article{Benetos2013AutomaticMT, title={Automatic music transcription: challenges and future directions}, author={Emmanouil Benetos and Simon Dixon and Dimitrios Giannoulis and Holger Kirchhoff and Anssi Klapuri}, journal={Journal of Intelligent Information Systems}, year={2013}, volume={41}, pages={407-434} }