-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Workflow orchestrator for scalable/reliable ingestion #207
Comments
@vipinshreyaskumar please create a separate folder in root-repo as of now. Sub folder can cause issues for ci/cd. Root folder could be called unoplat-code-confluence-harvestor if you feel that's a cool name. Second when you start committing please follow https://www.conventionalcommits.org/en/v1.0.0/ as our ci/cd depends on it and it will cause issues if commit messages do not follow the convention. So I am working on total rewamp for context performance reliability at algorithm level in #206 issue once that's completed we could merge harvestor code and current utility. |
|
Cloning a GitHub repo and running certain operations through linting tools on the codebase would be the starting point. @vipinshreyaskumar . Our config looks like this:
Right now we use:
then post doing this we have to run isort with below config
This will make prerequisites for our parsing. |
i have pushed description/diagram in readme which is unoplat-code-confluence in this branch for our milestone 2 @vipinshreyaskumar @apekshamehta @milind12 . i will do the same in our google docs tomorrow and kickoff. |
we now have a skeleton with one actual activity being run through fastapi implemented in temporal. We also have improved our contributor experience through task file - where one just has to run - task dev. to setup venv ,install packages and start fastapi server. @vipinshreyaskumar . One needs to install uv, task and fastapi cli that's all. |
This branch will have pushes everyday please do pull to checkout. This also has improved cli experience which accepts configuration and manages n requests per n repos to our fastapi based code confluence flow bridge. |
We have successfully incorporated two tasks via temporal:
Code is pushed. cc: @vipinshreyaskumar will update next plan for activities and in design doc as well. |
Tasks:
|
We have fixed launching parallel child workflows. For parent-child workflows child workflows will be long running and we will be using across parent/child workflows database references to merge/relate data so we do not require parent to be waiting for it. so we have set parent child policy to be abandon. This will ensure child workflows keep running independent of parent. |
Workflow orchestration has to be enabled for Scalable and reliable ingestion.
The text was updated successfully, but these errors were encountered: