Understanding Optical Music Recognition

@article{CalvoZaragoza2020UnderstandingOM,
  title={Understanding Optical Music Recognition},
  author={Jorge Calvo-Zaragoza and Jan Hajic and Alexander Pacha},
  journal={ACM Computing Surveys (CSUR)},
  year={2020},
  volume={53},
  pages={1 - 35}
}
For over 50 years, researchers have been trying to teach computers to read music notation, referred to as Optical Music Recognition (OMR). However, this field is still difficult to access for new researchers, especially those without a significant musical background: Few introductory materials are available, and, furthermore, the field has struggled with defining itself and building a shared terminology. In this work, we address these shortcomings by (1) providing a robust definition of OMR and… 
Optical Music Recognition: State of the Art and Major Challenges
TLDR
This paper reviews relevant works in OMR, including fundamental methods and significant outcomes, and highlights different stages of the OMR pipeline, which often lack standard input and output representation and standardised evaluation.
Exploiting the Two-Dimensional Nature of Agnostic Music Notation for Neural Optical Music Recognition
TLDR
This work considers two Convolutional Recurrent Neural Network schemes trained to simultaneously extract the shape and height information and to propose different policies for eventually merging them at the actual neural level to solve the OMR task at a staff-line level.
A Neural Approach for Full-Page Optical Music Recognition of Mensural Documents
TLDR
A full-page OMR system for Mensural notation scores that consists of simply two processes, which are enough to extract the symbolic music information from a full page and uses Selectional Auto-Encoders to extract single staff regions, combined with end-to-end stafflevel recognition based on Convolutional Recurrent Neural Networks for retrieving the music notation.
DoReMi: First glance at a universal OMR dataset
TLDR
This paper provides a first look at DoReMi, an OMR dataset that addresses these challenges, and a baseline object detection model to assess its utility, and opens a new course of discussions that would ultimately aid that goal.
Region-based Layout Analysis of Music Score Images
TLDR
The results show that the choice of the model and its performance are crucial for the entire transcription process, and the proposed data-generation technique enables state-of-the-art results to be achieved with a limited set of labeled data.
An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition
TLDR
This work proposes two novel formulations for end-to-end polyphonic OMR—one treating the problem as a type of multi-task binary classification, and the other treating it as multi-sequence detection.
On the Integration of Language Models into Sequence to Sequence Architectures for Handwritten Music Recognition
TLDR
After studying various Language Model integration techniques, the experimental evaluation on historical handwritten music scores shows a significant improvement over the state of the art, showing that this is a promising research direction for dealing with such difficult manuscripts.
Multimodal image and audio music transcription
Optical Music Recognition (OMR) and Automatic Music Transcription (AMT) stand for the research fields that aim at obtaining a structured digital representation from sheet music images and acoustic
MULTIMODAL AUDIO AND IMAGE MUSIC TRANSCRIPTION
Optical Music Recognition (OMR) and Automatic Music Transcription (AMT) stand for the research fields which aim at obtaining a structured digital representation of the music content present in either
Applying Automatic Translation for Optical Music Recognition’s Encoding Step
TLDR
This paper proposes and evaluates the feasibility of using machine translation techniques, using statistical approaches and neural systems, to automatically convert the results of graphical encoding recognition into a standard semantic format, which can be exported as a digital score.
...
1
2
3
4
...

References

SHOWING 1-10 OF 428 REFERENCES
A Baseline for General Music Object Detection with Deep Learning
TLDR
A baseline for general detection of musical symbols with deep learning is presented and the first time that competing music object detection systems from the machine learning paradigm are directly compared to each other.
How current optical music recognition systems are becoming useful for digital libraries
TLDR
This work takes an OMR system based on the traditional pipeline and an end-to-end system, which represent the current state of the art, and illustrates in proof-of-concept experiments their applicability in retrieval settings, and indicates that in some settings, current OMR can be used as a general tool for enriching digital libraries.
Towards a Universal Music Symbol Classifier
TLDR
This paper presents the approach towards unifying multiple datasets into the largest currently available body of over 90000 musical symbols that belong to 79 classes, containing both handwritten and printed music symbols.
Towards Self-Learning Optical Music Recognition
TLDR
A new approach to solve OMR is proposed by investigating how humans read music scores and by imitating that behavior with machine learning, which can be performed at high rates of confidence which is comparable to the performance of humans on the same task.
Camera-PrIMuS: Neural End-to-End Optical Music Recognition on Realistic Monophonic Scores
TLDR
This work evaluates the performance of an end-to-end approach that uses a deep convolutional recurrent neural network (CRNN) over non-ideal image conditions of music scores and confirms that the CRNN is able to successfully solve the task under these conditions, thereby representing a groundbreaking piece of research towards useful OMR systems.
End-to-End Neural Optical Music Recognition of Monophonic Scores
TLDR
This work studies the use of neural networks that work in an end-to-end manner by using a neural model that combines the capabilities of convolutional neural Networks, which work on the input image, and recurrent neural networks, which deal with the sequential nature of the problem.
A Deep Approach for Handwritten Musical Symbols Recognition
TLDR
This paper presents a new methodology applying deep learning techniques to recognise musical notes in digitalised handwritten musical scores and test it on a ground truth dataset of music scores.
In Search of a Dataset for Handwritten Optical Music Recognition: Introducing MUSCIMA++
TLDR
The MUSCIMA++ dataset of handwritten music notation is designed that addresses musical symbol recognition and notation reconstruction and allows training and evaluating models for symbol classification, symbol localization, and notation graph assembly, both in isolation and jointly.
A HYBRID HMM-RNN MODEL FOR OPTICAL MUSIC RECOGNITION
Optical music recognition (OMR) serves as one of the key technologies in Music Information Retrieval by mining symbolic knowledge directly from images of scores. A fullfledged OMR system encompasses
Optical Music Recognition with Convolutional Sequence-to-Sequence Models
TLDR
A deep learning architecture called a Convolutional Sequence-to-Sequence model is presented to both move towards an end- to-end trainable OMR pipeline, and apply a learning process that trains on full sentences of sheet music instead of individually labeled symbols.
...
1
2
3
4
5
...