Simplify `run.py` 🏃‍♀️ #308

KristinaUlicna · 2023-10-12T20:10:15Z

PR contribution summary

Why is this PR useful / good for? Please describe the problem(s) you're trying to address.

Allows node feature & edge property computation at runtime by:
- Checking all graphs in provided train, valid and infer datasets
- Checking if graph attribute storage is allowed in the config; error otherwise.
- Computing & storing the features in the source graph if allowed.

List of proposed changes / linked issues & discussions

✅ Resolves [DEVELOPMENT] Allow feature / property extraction to be computed at run-time #297
✅ Resolves [FEATURE REQUEST] Append node features to the graph once and for all #276 [duplicate]

What should a reviewer concentrate their feedback on?

🏃 Scripts to run - replicating the results with run.py
💻 Code quality
📝 Everything looks OK?

What type of PR is this? (check all applicable)

🪄 Feature
🐛 Bug fix
🧑‍💻 Code refactor / style
🔥 Performance Improvements

Added tests?

👍 yes, re-vitalised the commented tests + now expects test_run.pyto fail

Hints for the reviewer:

At review time, please try a bunch of files which do not have the node features & edge properties in them.
Set the store_graph_attributes_permanently config hyperparameter to False 👇

store_graph_attributes_permanently: False

which should throw an error & instructions on what to do. Please follow the instructions until you get to the point where you can successfully train your model 😃

PR review summary

Describe what this PR does & how you reviewed the individual items, where needed:

Some helper checks to tick off:

Focus on image annotation
Focus on model training
Could any optimization be applied?
Is there any redundant code?
Are there any spelling errors?

In conclusion, after my review, I'd like to:

🙋 ask some clarifying questions
🙅 suggest some specific changes

review-notebook-app · 2023-10-12T20:10:20Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

crangelsmith · 2023-10-16T09:01:36Z

grace/run.py

-
-        return target_list, subgraph_dataset
+    # Create a transform function with frozen arguments:
+    check_and_chop_partial = partial(


I find the name of this function a bit confusing, it is not very clear to what is happening here. As is in the main run.py scrip maybe we should add more comments to explain what is happening?

crangelsmith · 2023-10-16T09:03:02Z

grace/training/build.py

+    store_permanently: bool = False,
+    extractor_fn: str | Path = None,
+):
+    # Check if datasets are ready for training:


Similar as described for run.py maybe add some docstring description of what these functions are about?

crangelsmith · 2023-10-16T09:24:26Z

Question:

If I already have the graph attributes saved in my graph but I want to update them I should also set: store_graph_attributes_permanently: True?

In that case some suggestions:

I think the config name should be a bit more clear that we are actually computing as well as storing these attributes.
I think we should log clearly what is happening with the attributes in the console. e.g:
- If we load a graph that already has attributes and the config is False, log that we are using precomputed attributes.
- If we load a graph that already has attributes and the config is True, log that we are using recomputing the attributes that will be overwritten.
- Similar to when you don't have attributes yet, the case where store_graph_attributes_permanently: False is very well documented/described but when is true we should log it too.

KristinaUlicna · 2023-10-25T16:13:23Z

After some offline discussions & design re-consideration, this PR lost on value as the issues can be approached differently, without needing to store the NODE_FEATURES vector in the written-out graph. This gets addressed in #329 . Closing this deprecated PR now.

KristinaUlicna added 3 commits October 12, 2023 21:05

Builder & checker for train, valid & infer datasets

85e1a65

Include tqdm progress bar

225308f

Simplify run.py script

a48cfae

KristinaUlicna added the methodology Building functional & diverse pipeline label Oct 12, 2023

KristinaUlicna requested a review from crangelsmith October 12, 2023 20:10

KristinaUlicna self-assigned this Oct 12, 2023

KristinaUlicna added this to the Merge `dev` -> `main` milestone Oct 12, 2023

KristinaUlicna changed the base branch from main to development October 12, 2023 20:10

KristinaUlicna marked this pull request as draft October 12, 2023 20:10

Uncommented failing tests with @pytest.mark.skip

3058983

KristinaUlicna mentioned this pull request Oct 13, 2023

Edge properties for reasonable edge classification 📊 #285

Merged

9 tasks

KristinaUlicna added 5 commits October 13, 2023 14:38

Revitalise test_run with xfail

cf3dce9

Build dataset for training run session

d3fae06

Optional hparam to overwrite files

876bb8e

Simplify run with partial fn specs

893753c

Merge branch 'development' into extractor

31c1a4e

KristinaUlicna linked an issue Oct 13, 2023 that may be closed by this pull request

[DEVELOPMENT] Allow feature / property extraction to be computed at run-time #297

Closed

KristinaUlicna marked this pull request as ready for review October 13, 2023 16:10

Add storing hparam to example config.yaml

a60029d

crangelsmith reviewed Oct 16, 2023

View reviewed changes

KristinaUlicna marked this pull request as draft October 23, 2023 20:06

KristinaUlicna closed this Oct 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify `run.py` 🏃‍♀️ #308

Simplify `run.py` 🏃‍♀️ #308

KristinaUlicna commented Oct 12, 2023 •

edited

Loading

review-notebook-app bot commented Oct 12, 2023

crangelsmith Oct 16, 2023

crangelsmith Oct 16, 2023 •

edited

Loading

crangelsmith commented Oct 16, 2023

KristinaUlicna commented Oct 25, 2023

Simplify run.py 🏃‍♀️ #308

Simplify run.py 🏃‍♀️ #308

Conversation

KristinaUlicna commented Oct 12, 2023 • edited Loading

PR contribution summary

List of proposed changes / linked issues & discussions

What should a reviewer concentrate their feedback on?

What type of PR is this? (check all applicable)

Added tests?

Hints for the reviewer:

PR review summary

review-notebook-app bot commented Oct 12, 2023

crangelsmith Oct 16, 2023

Choose a reason for hiding this comment

crangelsmith Oct 16, 2023 • edited Loading

Choose a reason for hiding this comment

crangelsmith commented Oct 16, 2023

KristinaUlicna commented Oct 25, 2023

Simplify `run.py` 🏃‍♀️ #308

Simplify `run.py` 🏃‍♀️ #308

KristinaUlicna commented Oct 12, 2023 •

edited

Loading

crangelsmith Oct 16, 2023 •

edited

Loading