GitHub

This is the codebase for the project:

OpenMU: Your Swiss Army Knife for Music Understanding

Paper

Project Page

Quick start

1.1 Build the environment

A Dockerfile is provided and it should work out-of-the-box.

1.2 Datasets and checkpoints

For OpenMU-Bench, our created benchmark for music understanding tasks, please download from here.

Please also download the checkpoints for inference.

1.3 Training

OpenMU contains two training stages:

Stage 1 training: OpenMU is trained to output captions conditioned on an input image;
Stage 2 training: instruction following, where OpenMU follows instructions in the music domain.

To launch training, please checkout and use stage1.sh and stage2.sh respectively.

1.4 Inference

Please c.f. run_inference.sh for running inference of the provided checkpoints. We use lyrics understanding (model_lyrics_grid.py) as an example in the scripts; replace it with other scripts (e.g., model_musicqacaption.py) for other splits of OpenMU-Bench (e.g., MusicQA captioning).

If you find our work helpful, please consider citing us:

@article{zhao2024openmu,
  title={OpenMU: Your Swiss Army Knife for Music Understanding},
  author={Zhao, Mengjie and Zhong, Zhi and Mao, Zhuoyuan and Yang, Shiqi and Liao, Wei-Hsiang and Takahashi, Shusuke and Wakaki, Hiromi and Mitsufuji, Yuki},
  journal={arXiv preprint arXiv:2410.15573},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Dockerfiles		Dockerfiles
llava		llava
scripts		scripts
LICENSE		LICENSE
README.MD		README.MD
pyproject.toml		pyproject.toml
run_inference.sh		run_inference.sh
stage1.sh		stage1.sh
stage2.sh		stage2.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quick start

1.1 Build the environment

1.2 Datasets and checkpoints

1.3 Training

1.4 Inference

If you find our work helpful, please consider citing us:

About

Releases

Packages

Languages

License

sony/openmu

Folders and files

Latest commit

History

Repository files navigation

Quick start

1.1 Build the environment

1.2 Datasets and checkpoints

1.3 Training

1.4 Inference

If you find our work helpful, please consider citing us:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages