This repository is a community effort of collecting meta-information about all DCASE related datasets into one place. Based on the metadata in this repository, tables in the DCASE Datalist website are generated.
This repository focuses specifically on pre-packaged datasets rather than online data repositories. A dataset should be well documented, packaged for easy usage, and have a free or open license for academic research.
- Fork this repository
- Identify a category for the dataset: scenes, events, captions
- Copy appropriate dataset template based on the category (e.g.
datasets\scenes_template.yaml
) from the root ofdatasets\
todatasets\scenes
,datasets\sounds
, ordatasets\captions
, and rename the template accordingly - Update information in the YAML file. Delete fields that are not needed. If you would like to have fields that do not yet exist, raise an issue in Github.
- Make a pull request from your fork to the main repository.
Optionally, you can raise an issue to this repository.
- Run
update.py
to update.json
files - Start the local server with
start_local_server.py
- Open URL http://localhost:8000/