Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Updates to the DMP metadata extractor Lambda for COKI #31

Open
briri opened this issue Dec 17, 2024 · 0 comments
Open

Updates to the DMP metadata extractor Lambda for COKI #31

briri opened this issue Dec 17, 2024 · 0 comments

Comments

@briri
Copy link
Collaborator

briri commented Dec 17, 2024

  • Update the file size limitation of the DmpExtractor lambda to allow a single file to be 100MB (currently set to 10MB)

Cleanup needed on the DmpIndexer needed:

  • Remove grantId regexes. There is not enough consistency to manage these. COKI will handle the cleanup on their end
  • Some of the list fields contain empty objects rather than strings or having an empty list: funders, funder_ids, grand_ids, see DMPs doi.org/10.48321/D1TG9N, doi.org/10.48321/D18C88. For example: "funders":[{}] "grand_ids":[{}]
  • The description field sometimes has a boolean value rather than a string or NULL, see DMP doi.org/10.48321/D114471AC3
  • The modified field sometimes has a boolean value rather than date, see DMP doi.org/10.48321/D114471AC3
  • The project_end and project_start fields sometimes have boolean values rather than a date, see DMP doi.org/10.48321/D1MD2F
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant