Skip to content

Commit

Permalink
Merge pull request #400 from openforcefield/amcisaac-patch-1
Browse files Browse the repository at this point in the history
Update README.md with details about compute expansion
  • Loading branch information
amcisaac authored Nov 4, 2024
2 parents 919f255 + f3cbe78 commit 44efa60
Showing 1 changed file with 16 additions and 2 deletions.
18 changes: 16 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -91,14 +91,28 @@ The programatic description is provided below, with an example of the notebook a

1. Create a new branch as described above, and navigate to the submission directory of the dataset you want to expand.
2. Create a new jupyter notebook called `generate-compute.ipynb` [example here](https://github.com/openforcefield/qca-dataset-submission/blob/master/submissions/2024-09-18-OpenFF-NAGL2-ESP-Timing-Benchmark-v1.1/generate-compute.ipynb).
3. In the notebook, either download the original dataset and remove the molecules and _original_ `QCSpec`, or re-create the dataset with the same metadata as the original (e.g. same name, description, etc) and skip the molecule addition step.
3. In the notebook, either download the original dataset and remove the molecules and _original_ `QCSpec`, or re-create the dataset with the same name as the original and skip the molecule addition step.
* See below for details about how changes to the dataset are propagated; note that the dataset name must be the same, and changes to any metadata except `compute-tag` and the `QCSpec` will be ignored when submitting the compute expansion.
* Please note that the default `compute_tag` is `openff`; if you need to use a different one, please add it explicitly to the dataset at this step, as the `compute.json` file overrides the compute tag added manually to the PR. If you do need to change the compute tag after submission, you can change it by updating the label on the PR and the change will take effect when the error cycling action runs next.
4. Add the _new_ `QCSpec` to the dataset, and save the dataset to `compute.json`, example [here](https://github.com/openforcefield/qca-dataset-submission/blob/add-ddx-to-nagl-benchmark/submissions/2024-09-18-OpenFF-NAGL2-ESP-Timing-Benchmark-v1.1/compute.json).
4. Add the _new_ `QCSpec` to the dataset, and save the dataset to `compute.json`, example [here](https://github.com/openforcefield/qca-dataset-submission/blob/add-ddx-to-nagl-benchmark/submissions/2024-09-18-OpenFF-NAGL2-ESP-Timing-Benchmark-v1.1/compute.json).
5. Add the additional compute spec to the submission's `README.md` file.
6. Add the `generate-compute.ipynb` and `compute.json` files to the submission's `QCSubmit Manifest` entry in the `README.md` file.
7. Proof the submission and open a PR. Dataset validation will run automatically.
8. Once the dataset is validated, request a review, and once approved, your compute expansion will be submitted!

When the PR is merged, the following happens:

* CI checks for `compute*.json*`, so files can be called anything so long as they follow that pattern.

* This gets loaded into a QCSubmit `dataset` structure in CI (see `lifecycle.py`, [`SubmittableBase`](https://github.com/openforcefield/qca-dataset-submission/blob/master/management/lifecycle.py#L333)) and submitted to MolSSI with [`openff.qcsubmit.datasets.datasets._BaseDataset.submit()`](https://github.com/openforcefield/openff-qcsubmit/blob/main/openff/qcsubmit/datasets/datasets.py#L174)

* `submit()` checks if the dataset already exists using only the dataset type and name. Changes in descriptions, other metadata, etc. don't affect anything. New/different molecules will also be ignored if the dataset name already exists.

* `submit()` adds the specifications

* `submit()` submits with the `compute_tag` and `priority` within the new `compute.json`.

* Other info in the dataset, such as `dataset_tags`, are not incorporated into additional compute submissons and thus changing them will not affect the dataset.

# The Lifecycle of a Dataset Submission

Expand Down

0 comments on commit 44efa60

Please sign in to comment.