Sfitz add mosdepth quantize #88

sorelfitzgibbon · 2024-10-30T01:43:08Z

Description

Add mosdepth quantize and update nftest.
Currently written for 4 quantize bins, but the original software allows any number of bins. An issue has been made.

Testing Results

NFTest
- log: /hot/software/pipeline/pipeline-generate-SQC-BAM/Nextflow/development/unreleased/sfitz-add-mosdepth-quantize/log-nftest-20241101T231717Z.log
- cases: default set

Checklist

I have read the code review guidelines and the code review best practice on GitHub check-list.
I have reviewed the Nextflow pipeline standards.
The name of the branch is meaningful and well formatted following the standards, using [AD_username (or 5 letters of AD if AD is too long)]-[brief_description_of_branch].
I have set up or verified the branch protection rule following the github standards before opening this pull request.
I have added my name to the contributors listings in the manifest block in the nextflow.config as part of this pull request, am listed
already, or do not wish to be listed. (This acknowledgement is optional.)
I have added the changes included in this pull request to the CHANGELOG.md under the next release version or unreleased, and updated the date.
I have updated the version number in the metadata.yaml and manifest block of the nextflow.config file following semver, or the version number has already been updated. (Leave it unchecked if you are unsure about new version number and discuss it with the infrastructure team in this PR.)
I have tested the pipeline using NFTest, or I have justified why I did not need to run NFTest above.

…M into merge-test

yashpatel6

Generally looking good! A few minor details to iron out:

yashpatel6 · 2024-11-01T06:12:35Z

config/schema.yaml

+mosdepth_quantize_use_fast_algorithm:
+  type: 'Bool'
+  required: false
+  default: false
+  help: 'Use fast algorithm for quantizing coverage values'


question (non-blocking): Is there a downside to enabling the use of the fast algorithm by default?

Yeah, this is a difficult decision. Not using fast mode gives more "correct" results. Fast mode ignores paired read overlaps and CIGAR strings (thus indels wrt ref). Ignoring the paired read overlap is the bigger issue, especially for samples with small insert sizes (wrt read length). This is noted in the README. The time difference isn't clear as I haven't benchmarked more than a few samples and not directly on scratch. It's enough that the mosdepth author recommends fast mode for most use-cases (I assume non-small insert cases). We currently have fast mode true by default for the regular mosdepth coverage calculation, so we should probably change one or the other to make them consistent.

yashpatel6 · 2024-11-01T06:14:58Z

test/config/mosdepth-coverage.config

+process {
+    withName: run_validate_PipeVal {
+        when = false
+    }
+}


suggestion: This can be removed, the test file should be small enough that the validation shouldn't pose a problem

yashpatel6 · 2024-11-01T06:17:16Z

main.nf

+include { assess_coverage_mosdepth } from './module/windows_mosdepth' addParams(
+    workflow_output_dir: "${params.output_dir_base}/mosdepth-${params.mosdepth_version}",
+    workflow_log_output_dir: "${params.log_output_dir}/process-log/mosdepth-${params.mosdepth_version}"
+    )


suggestion: This import is duplicated below; one of them can be removed

yashpatel6 · 2024-11-01T06:19:20Z

module/quantize_mosdepth.nf

+    export MOSDEPTH_Q0=${params.mosdepth_q0_label}
+    export MOSDEPTH_Q1=${params.mosdepth_q1_label}
+    export MOSDEPTH_Q2=${params.mosdepth_q2_label}
+    export MOSDEPTH_Q3=${params.mosdepth_q3_label}


suggestion: Quoting since the parameters are used directly and to make it a bit safer

Suggested change

export MOSDEPTH_Q0=${params.mosdepth_q0_label}

export MOSDEPTH_Q1=${params.mosdepth_q1_label}

export MOSDEPTH_Q2=${params.mosdepth_q2_label}

export MOSDEPTH_Q3=${params.mosdepth_q3_label}

export MOSDEPTH_Q0="${params.mosdepth_q0_label}"

export MOSDEPTH_Q1="${params.mosdepth_q1_label}"

export MOSDEPTH_Q2="${params.mosdepth_q2_label}"

export MOSDEPTH_Q3="${params.mosdepth_q3_label}"

yashpatel6 · 2024-11-01T06:21:07Z

module/quantize_mosdepth.nf

+        path ".command.*"
+
+    script:
+    output_filename = generate_standard_filename("mosdepth${params.picard_version}",


Suggested change

output_filename = generate_standard_filename("mosdepth${params.picard_version}",

output_filename = generate_standard_filename("mosdepth-${params.picard_version}",

sorelfitzgibbon · 2024-11-02T22:02:13Z

Test path within initial description has been updated for new test results.

sorelfitzgibbon added 30 commits June 27, 2024 10:00

add mosdepth and index file and change path name

1e093a2

update resources

5375b19

update version in metadata

7ff81eb

fix path to bam in tuple

9a971e5

update changelog

fcd824b

resource adjustments

91136c9

adjust log info

5e345c4

change process names

8beaf43

rename bamqc_outformat to bamqc_output_format

acbb0fd

rename bamqc_outformat to bamqc_output_format

668d83c

remove fastqc as default

b178e75

fix CollectWgsMetrics params bug

b59503d

add mosdepth and index file and change path name

c3d525f

update changelog

b97d512

Merge branch 'main' of github.com:uclahs-cds/pipeline-generate-SQC-BA…

7d536cd

…M into merge-test

fix left over merge lines M64.config

f90c11f

change template mosdepth fast to false

35f5aac

add quantize to resource configs

4975b4c

add quantize to resource configs

cad71c7

update schema

2f89689

add quantize

80b352e

add quantize

76fcb40

Merge branch 'sfitz-add-mosdepth' into sfitz-add-mosdepth-quantize

8a254cc

add mosdepth to nftest

1aca7bc

update nftest mosdepth slow

8ea75a1

change algorithm option coverage to windows

81ee369

update README

7c0812b

merge in coverage to windows

82bdfe9

require quantize cutoffs

1715272

output filename dash and add to test config

51fdb1e

sorelfitzgibbon added 11 commits July 29, 2024 12:56

update changelog

3f11738

add mosdepth per-base output

313b079

update submodules

ce597f6

merge main

55679d0

move gitignore lines to local only

8c976ab

add nftest config files

8aa3634

finish main merge

7462538

alorithm name

cab90a8

add quantize to nftest and update config names

4fe2727

typo

027bad9

update readme

7e48841

sorelfitzgibbon requested a review from a team as a code owner October 30, 2024 01:43

sorelfitzgibbon added 3 commits October 29, 2024 19:35

fix duplicate keys

9527f7f

fix nftest

563775d

reorganize nftest.yml

5fb7a72

yashpatel6 reviewed Nov 1, 2024

View reviewed changes

sorelfitzgibbon added 3 commits November 1, 2024 15:36

fix minor issues

b5be444

fix output filenames, mosdepth windows too

4514bd4

Merge remote-tracking branch 'origin' into sfitz-add-mosdepth-quantize

20a6d52

sorelfitzgibbon requested a review from yashpatel6 November 2, 2024 22:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sfitz add mosdepth quantize #88

Sfitz add mosdepth quantize #88

sorelfitzgibbon commented Oct 30, 2024 •

edited

Loading

yashpatel6 left a comment

yashpatel6 Nov 1, 2024

sorelfitzgibbon Nov 1, 2024 •

edited

Loading

yashpatel6 Nov 1, 2024

yashpatel6 Nov 1, 2024

yashpatel6 Nov 1, 2024

yashpatel6 Nov 1, 2024

sorelfitzgibbon commented Nov 2, 2024

	output_filename = generate_standard_filename("mosdepth${params.picard_version}",
	output_filename = generate_standard_filename("mosdepth-${params.picard_version}",

Sfitz add mosdepth quantize #88

Are you sure you want to change the base?

Sfitz add mosdepth quantize #88

Conversation

sorelfitzgibbon commented Oct 30, 2024 • edited Loading

Description

Testing Results

Checklist

yashpatel6 left a comment

Choose a reason for hiding this comment

yashpatel6 Nov 1, 2024

Choose a reason for hiding this comment

sorelfitzgibbon Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

yashpatel6 Nov 1, 2024

Choose a reason for hiding this comment

yashpatel6 Nov 1, 2024

Choose a reason for hiding this comment

yashpatel6 Nov 1, 2024

Choose a reason for hiding this comment

yashpatel6 Nov 1, 2024

Choose a reason for hiding this comment

sorelfitzgibbon commented Nov 2, 2024

sorelfitzgibbon commented Oct 30, 2024 •

edited

Loading

sorelfitzgibbon Nov 1, 2024 •

edited

Loading