Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data missing and provenance problems #8

Open
pmjklemm opened this issue Jul 11, 2023 · 0 comments
Open

Data missing and provenance problems #8

pmjklemm opened this issue Jul 11, 2023 · 0 comments

Comments

@pmjklemm
Copy link

pmjklemm commented Jul 11, 2023

Hey Adrian, I found some problems with the webservice and I have some suggestions:

  1. Some projects are missing in the public projects tab (https://orthology.benchmarkservice.org/proxy/projects/2020/)
    that are present in the public results tab (https://orthology.benchmarkservice.org/proxy/results/2020/)
    For example SonicParanoid2-sens, SonicParanoid2-... (only SonicParanoid is present)

  2. The names of the tools do not perfectly match (e.g. SonicParanoid_sensitive = sonicparanoid-sens) between the projects and results tab. Maybe crosslink them, so there is no confusion or use short IDs.

  3. Version numbers are missing for almost all projects (e.g. Domainoid+ has no tool version assigned).

  • idea: you could add this as a requirement field when submitting to the database
  1. Parameters are inconsistent, some projects are well described and others not.
  • idea: make the full command that generates the output a requirement to submit data, so that others can reproduce the results
  • I tried to reproduce the SonicParanoid-sens results with the description but got very different results, probably different versioning or some other post-processing is missing...
  • RBH/BBH: what is the program that generates these results? My RBH graph with 99% similarity and the specified e-value differs from this result in the benchmarks.
  1. It is clear to me that *-g represents a grouped upload, but maybe separate the results or make it clearer what is a paired upload and what is a grouped one.

  2. The 'EnsemblCompara-e56' has multiple entries and some of the data links go to your paper instead of the data uploaded for this result

Maybe this helps for the next qfo release!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant