Suggestions on the initial toolbox PR #5

jkgoodrich · 2024-12-11T05:49:09Z

No description provided.

- Add a description of the repo structure to the README - Add some potential requirements - Update the documentation and make sure it works - Create a notebook specific to loading gnomAD release data and just showing what each dataset looks like.

- Addition of variant.py to store some functions that can also be used by frequency based filtering - Changes to frequency filtering to use new default settings if not passed a ht

review-notebook-app · 2024-12-11T05:49:14Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

KoalaQin

I'm sending these over for now because I need 1 of them for testing.

gnomad_toolbox/load_data.py

gnomad_toolbox/analysis/general.py

KoalaQin

Thank you for making this PR, your new structure of code makes more sense!
I think we can merge yours after some changes and I could work on the to-dos.

KoalaQin · 2024-12-13T22:06:59Z

README.md

+│   ├── intro_to_release_data.ipynb    # Jupyter notebook introducing the loading of gnomAD release data.
+```
+
+# TODO: Add fully detailed info about how to install and open the notebooks.


These steps should include:

install miniconda;

set up a conda env for a specific version of Hail (and update JAVA);

pip install gnomad_toolbox;

set up the service account;

I can write this part once your PR is merged into mine.

Sounds good, but don't add anything yet. First step is to get a bunch of tickets in for todo's and we can decide who will tackle what

gnomad_toolbox/analysis/general.py

KoalaQin · 2024-12-14T00:05:54Z

gnomad_toolbox/filtering/variant.py

+        hl.utils.warning(
+            f"No variant found at {variant.locus} with alleles {variant.alleles}"
+        )


This warning is not working. Should we use import warnings instead?
Could be:

warnings.warn( f"No variant found at {variant.locus} with alleles {variant.alleles}", RuntimeWarning, )

really? I got the warning

Yeah, it's working, but it was printed after the init cell.
I didn't see it because I was far down the notebook.

it's just a hail/notebook thing.

KoalaQin · 2024-12-18T14:28:53Z

gnomad_toolbox/filtering/variant.py

+    #    )
+    # )
+
+    # TODO: Consider this alternative approach to get the intervals from gencode. That


I think your way is good, only a tiny problem, the gencode the Browser team used was preprocessed, I think they filtered out genes with the same ENSG that appear on both chrX and chrY (46 of them: 26 PAR, overlapping locus etc). We will get a different number of variants for those genes, e.x. https://gnomad.broadinstitute.org/gene/ENSG00000196433?dataset=gnomad_r4.
Do we want to add a prefilter function or just add a note?

OK, lets go with the faster way and a filter so we get the same numbers as the browser would get. For now lets just add a TODO and a ticket to do it

Co-authored-by: Qin He <44242118+KoalaQin@users.noreply.github.com>

KoalaQin

LGTM!

jkgoodrich added 4 commits December 9, 2024 10:49

- Restructure files

c183470

- Add a description of the repo structure to the README - Add some potential requirements - Update the documentation and make sure it works - Create a notebook specific to loading gnomAD release data and just showing what each dataset looks like.

- Restructure files

095a234

- Add a description of the repo structure to the README - Add some potential requirements - Update the documentation and make sure it works - Create a notebook specific to loading gnomAD release data and just showing what each dataset looks like.

- Modifications to support setting a default data_type and version

5fe3010

- Addition of variant.py to store some functions that can also be used by frequency based filtering - Changes to frequency filtering to use new default settings if not passed a ht

More clean-up of notebooks and functions

5330ea6

jkgoodrich requested a review from KoalaQin December 11, 2024 05:49

jkgoodrich assigned jkgoodrich and KoalaQin Dec 11, 2024

jkgoodrich added the toolbox label Dec 11, 2024

Add notebooks to git

b853fc2

KoalaQin requested changes Dec 13, 2024

View reviewed changes

Fix unterminated string error

710bdcf

KoalaQin requested changes Dec 18, 2024

View reviewed changes

jkgoodrich and others added 2 commits December 18, 2024 09:01

Apply suggestions from code review

4bfc305

Co-authored-by: Qin He <44242118+KoalaQin@users.noreply.github.com>

format

ecb664b

jkgoodrich requested a review from KoalaQin December 18, 2024 16:09

KoalaQin approved these changes Dec 18, 2024

View reviewed changes

jkgoodrich merged commit d09953c into qh/draft_toolbox Dec 18, 2024
2 checks passed

jkgoodrich deleted the jg/draft_toolbox branch December 18, 2024 18:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Suggestions on the initial toolbox PR #5

Suggestions on the initial toolbox PR #5

jkgoodrich commented Dec 11, 2024

review-notebook-app bot commented Dec 11, 2024

KoalaQin left a comment

KoalaQin left a comment

KoalaQin Dec 13, 2024

jkgoodrich Dec 18, 2024

KoalaQin Dec 14, 2024

jkgoodrich Dec 18, 2024

KoalaQin Dec 18, 2024 •

edited

Loading

KoalaQin Dec 18, 2024

KoalaQin Dec 18, 2024

jkgoodrich Dec 18, 2024

KoalaQin left a comment

Suggestions on the initial toolbox PR #5

Suggestions on the initial toolbox PR #5

Conversation

jkgoodrich commented Dec 11, 2024

review-notebook-app bot commented Dec 11, 2024

KoalaQin left a comment

Choose a reason for hiding this comment

KoalaQin left a comment

Choose a reason for hiding this comment

KoalaQin Dec 13, 2024

Choose a reason for hiding this comment

jkgoodrich Dec 18, 2024

Choose a reason for hiding this comment

KoalaQin Dec 14, 2024

Choose a reason for hiding this comment

jkgoodrich Dec 18, 2024

Choose a reason for hiding this comment

KoalaQin Dec 18, 2024 • edited Loading

Choose a reason for hiding this comment

KoalaQin Dec 18, 2024

Choose a reason for hiding this comment

KoalaQin Dec 18, 2024

Choose a reason for hiding this comment

jkgoodrich Dec 18, 2024

Choose a reason for hiding this comment

KoalaQin left a comment

Choose a reason for hiding this comment

KoalaQin Dec 18, 2024 •

edited

Loading