Skip to content

Commit

Permalink
Shifts to using IGC score instead of gentrain score in vcf2adpc.py fo…
Browse files Browse the repository at this point in the history
…r contamination checks.

Previously GC_SCORE was added to the adpc.bin which had depenency that a cluster egt file had to be used in preparation of vcf/bcf. IGC score is encoded in gtc so doesn't depended on cluster egt file. The conamination scores should also be more similar with the gtc input.
  • Loading branch information
rajwanir2 committed Sep 13, 2024
1 parent f4a0657 commit 270e105
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions src/cgr_gwas_qc/workflow/scripts/vcf2adpc.py
Original file line number Diff line number Diff line change
Expand Up @@ -45,15 +45,15 @@ def main(
"""

vcf = VariantFile(bcf_file)
sample_fields_to_fetch = ["X", "Y", "NORMX", "NORMY", "GT"]
info_fieilds_to_fetch = ["ALLELE_A", "ALLELE_B", "GC_SCORE"]
sample_fields_to_fetch = ["X", "Y", "NORMX", "NORMY", "GT", "IGC"]
info_fieilds_to_fetch = ["ALLELE_A", "ALLELE_B"]

renaming_scheme = {
"X": "x_raw",
"Y": "y_raw",
"NORMX": "x_norm",
"NORMY": "y_norm",
"GC_SCORE": "genotype_score",
"IGC": "genotype_score",
}

vcf_info = []
Expand Down

0 comments on commit 270e105

Please sign in to comment.