Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Transfer Processed data to Delta Lake Repository #27

Open
Tracked by #25
j-knudson opened this issue Sep 15, 2023 · 0 comments
Open
Tracked by #25

Transfer Processed data to Delta Lake Repository #27

j-knudson opened this issue Sep 15, 2023 · 0 comments

Comments

@j-knudson
Copy link
Collaborator

j-knudson commented Sep 15, 2023

Parent Feature: #25
Prerequisite Ticket: #26

Technical Details

Brainwave data donated for use in training Machine Learning algorithms must be processed with the file_shuffler app for privacy. This data then needs to be housed in a Delta Lake Repository. There are two versions of the file shuffling script: https://github.com/3C-SCSU/Avatar/blob/main/server/file-shuffler_linux.py and https://github.com/3C-SCSU/Avatar/blob/main/server/file-shuffler_win.py

The data is currently stored directly on the VPS with path: /home/Avatar/Deployment/data

Either

  1. create a connection from this directory to the Delta Lake repository, or
  2. create a direct transfer of processed data to the Delta Lake repository.

Of those two choices, the second option is preferred and the long term goal. If necessary, it could be implemented on a future ticket.

Acceptance Criteria

  • Processed data can be transferred to a Delta Lake Repository.
  • In creating the method for transferring the data to the repository, consider the long term goal of completing the transfer from the GUI
@j-knudson j-knudson added this to Avatar Sep 15, 2023
@j-knudson j-knudson converted this from a draft issue Sep 15, 2023
@j-knudson j-knudson added this to the DevOps Improvements milestone Sep 15, 2023
@j-knudson j-knudson moved this from 🔖 Ready to 📋 Backlog in Avatar Sep 15, 2023
anagorlov added a commit to anagorlov/Avatarpro that referenced this issue Dec 9, 2024
3C-SCSU added a commit that referenced this issue Dec 18, 2024
Added Ticket #27 documentation for Delta Lake data transfer
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: 📋 Backlog
Development

No branches or pull requests

1 participant