Analysis level metadata #30

rvosa · 2024-08-31T14:52:16Z

For downstream analysis of the performance of different pipelines and their parameters the uploads should include a YAML file with those parameters. The contents of the YAML file will then be attached to the result table. For example, YAML contents like this:

name: value

Would be joined with the result table such that there will be a column called name whose values will be value for all cells. The plan is that these would be factors (in the statistical sense) so that we can see whether different values for name have different results. To make this work, it is therefore key to decide on a small vocabulary for these terms and to look for overlap among the pipelines in some of their parameters. The simplest one would be pipeline: MGE versus pipeline: skim2mito. We can then merge the tables produced from the different pipelines and see, for example, whether MGE on average has more/fewer ambiguities than skim2mito.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Analysis level metadata #30

Analysis level metadata #30

rvosa commented Aug 31, 2024

Analysis level metadata #30

Analysis level metadata #30

Comments

rvosa commented Aug 31, 2024