Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement solution to write mapped data to storage #51

Merged
merged 2 commits into from
Apr 8, 2024

Conversation

sb-2011
Copy link
Contributor

@sb-2011 sb-2011 commented Apr 3, 2024

  • Added functionality to write post mapped data to disk
  • Introduced landing zone for location to write this data
  • Added tests including a test for uploading mapped data to storage using minio

@sb-2011 sb-2011 requested a review from akuny April 3, 2024 20:30
@sb-2011 sb-2011 force-pushed the add-datawriter-class branch 2 times, most recently from 6a3efb2 to 79fd4bf Compare April 4, 2024 15:54
.gitignore Outdated
@@ -182,3 +182,6 @@ node_modules/

# Mac Desktop Services Store
*.DS_Store

# Landing zone data storage
# /data/landing_zone/*
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure whether we're intending to keep data/landing_zone/ under version control given that this is commented out. Do we want to keep the directory and the README.md under VC and ignore everything else in the folder? If so I think we can accomplish that as such in the .gitignore file:

data/landing_zone/*
!data/landing_zone/README.md

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes good call, I'll make the changes.

@@ -68,11 +65,13 @@ def create_column_map_repository(self):
def create_logger(self):
return BasicLogger(__name__, logging.DEBUG)

def create_storage(self):
@staticmethod
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does the create_storage method here and in the other ApplicationContext classes necessarily need to be static? Just curious.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I made it static because I need to use a storage class instance in the task_queue.py file and I don't want to create an application context just for that use case. I wasn't able to pass the application context to the celery task like we discussed before due to serialization so this was a work around.

Copy link
Contributor

@akuny akuny left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, just a few comments. Also the CI job is failing FWIW.

@sb-2011 sb-2011 force-pushed the add-datawriter-class branch from 79fd4bf to 45a7e56 Compare April 5, 2024 19:58
@sb-2011 sb-2011 merged commit 1fad9de into main Apr 8, 2024
1 check passed
@sb-2011 sb-2011 deleted the add-datawriter-class branch April 8, 2024 16:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants