Tone Identification and Classification (TonIC)

TonIC is an algorithm for converting raw object detections of grand staff, measures and noteheads into structured MusicXML representations. It is modular and individual components can be independently modified or improved for experimental purposes.

Pipeline visualization

The input image is passed through two object detection pipelines. First, it is run through a BW filter and the OLA model responsible for measure and grand staff detections. Measure detections are then refined using the StaLiX library. Second, the input image is divided into smaller tiles, which are individually processed by the NotA model for notehead detection; the results are then stitched back together into a full-page. Finally, the combined detections are given to TonIC that outputs LMX, that is subsequently converted into MusicXML.

Additions to Existing Datasets

MUSCIMA#
- Combination of annotations from MUSCIMA++, sheet music from CVC-MUSCIMA, and 162 distinct empty backgrounds from MZKBlank.
MuNG to OLA
- Algorithm based on heuristics to create OLA objects - system measures, measures, staffs, system, grand staffs - from the original MuNG format.
OLiMPiC Scanned
- Downloads and unpacks the OLiMPiC Scanned dataset.

After cloning

Setup Python venv (tested with Python 3.11):

python3 -m venv .venv
.venv/bin/pip install --upgrade pip

.venv/bin/pip install -r requirements.txt

Clone Object Detection Tools and StaLiX and install them to venv:

# install StaLiX
cd <path>/stalix
.venv/bin/pip install -e .[viz]

# install od-tools
cd <path>/od-tools
.venv/bin/pip install -e .

Running demo

If run for the first time, the --update argument should be present - latest detection models will be downloaded. Input image or directory can be specified with -i <my-image.png>, output directory can be specified with -o <my-output-dir>. If no images are specified, preselected examples from MZK will be downloaded and passed through the pipeline. For algorithm visualization use --visualize <viz-level>.

# minimal inference run with example images
python3 -m demo --update --visualize 2

Running evaluation

For object detection $F_1$ score refer to the Object Detection Tools project.

Evaluation is run for two MusicXML files, their edit distance in computed for four different formats: Standardized, Reduced, Melody, Contour; see Format Overview for reference.

python3 -m tonic.SERVal <predicted-file-or-dir> <gold-file-or-dir>

During OLiMPiC evaluation, raw images are passed through the pipeline and predictions are compared to loaded ground truth. The user can specify how many images should be processed:

# random 200 samples from OLiMPiC will be processed
python3 -m tonic.SERVal.tonic -c 200

Known limitations

Visualizations are not optimized. Methods responsible for visualization are debug helper functions and should not be used in large scale scenarios.

References

Muscima++, CVC-Muscima

Jan Hajič jr., Pavel Pecina. In Search of a Dataset for Handwritten Optical Music Recognition: Introducing MUSCIMA++. CoRR, arXiv:1703.04824, 2017. https://arxiv.org/abs/1703.04824.

Alicia Fornés, Anjan Dutta, Albert Gordo, Josep Lladós. CVC-MUSCIMA: A Ground-truth of Handwritten Music Score Images for Writer Identification and Staff Removal. International Journal on Document Analysis and Recognition, Volume 15, Issue 3, pp 243-251, 2012. DOI.

LMX, OLiMPiC

Jiří Mayer, Milan Straka, Jan Hajič jr., Pavel Pecina. Practical End-to-End Optical Music Recognition for Pianoform Music. 18th International Conference on Document Analysis and Recognition, ICDAR 2024. Athens, Greece, August 30 - September 4, pp. 55-73, 2024. DOI, GitHub.

Mark Gotham, Robert Haigh, Petr Jonas. The OpenScore Lieder Corpus. Music Encoding Conference. Online, July 19-22, pp. 131-136, 2022. DOI: https://doi.org/10.17613/1my2-dm23, GitHub: https://github.com/OpenScore/Lieder

OMR Layout Analysis (OLA)

Vojtěch Dvořák, Jan jr. Hajič, and Jiří Mayer. Staff Layout Analysis Using the YOLO Platform. In Jorge Calvo-Zaragoza, Alexander Pacha, and Elona Shatri, editors, Proceedings of the 6th International Workshop on Reading Music Systems, pages 18-22, Online, 2024. https://arxiv.org/abs/2411.15741.

Contact

Developed and maintained by Vojtěch Dvořák (dvorak@ufal.mff.cuni.cz) as part of the Prague Music Computing Group lead by Jan Hajič jr. (hajicj@ufal.mff.cuni.cz).

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
datasetup		datasetup
demo		demo
docs		docs
tonic		tonic
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tone Identification and Classification (TonIC)

Pipeline visualization

Additions to Existing Datasets

After cloning

Running demo

Running evaluation

Known limitations

References

Muscima++, CVC-Muscima

LMX, OLiMPiC

OMR Layout Analysis (OLA)

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

v-dvorak/tonic

Folders and files

Latest commit

History

Repository files navigation

Tone Identification and Classification (TonIC)

Pipeline visualization

Additions to Existing Datasets

After cloning

Running demo

Running evaluation

Known limitations

References

Muscima++, CVC-Muscima

LMX, OLiMPiC

OMR Layout Analysis (OLA)

Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages