feat: Firestore stag db config #6

saiyam3243 · 2023-11-27T14:20:29Z

Motivation

For the integration of firestore db with this repo

Changes

Integration of firestore in repo and passed env variables via ci.yml

Checklist

added myself as assignee
correct reviewers
descriptive PR title using conventional commits.
description explains the motivation and details of the changes
tests cover my changes
documentation is updated
CI is green
breaking changes are discussed with the team and documented in the PR title ! (e.g. feat!: Update endpoint)

linear · 2023-11-27T14:20:33Z

META-36 Connect Firestore database to Sourcing Repo(s)

Just figured that one of your tickets was a duplicate, therefore adding a new one as otherwise your workload wouldn't match ours this week :)

Basically the same task of connecting firebase as for the analytics repo, but for at least 1 sourcing repo! Mostly reusing the same logic as in the analytics ticket.

github-actions · 2023-11-27T14:20:53Z

☂️ Python Coverage

current status: ✅

Overall Coverage

Lines	Covered	Coverage	Threshold	Status
163	55	34%	0%	🟢

New Files

File	Coverage	Status
parma_db/firestore_service.py	0%	🟢
parma_mining/parma_db/init.py	100%	🟢
parma_mining/parma_db/firestore_service.py	0%	🟢
TOTAL	33%	🟢

Modified Files

No covered modified files...

updated for commit: d1aca00 by action🐍

egekocabas · 2023-11-27T14:30:38Z

This is the same as here right?
Do you know how to test this db connection?

.github/workflows/ci.yml

saiyam3243 · 2023-11-27T16:18:41Z

This is the same as here right? Do you know how to test this db connection?

yes the task is similar. We need the db connection in both repos.

Constantin343 · 2023-11-27T18:29:54Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?

Also, I would need the secrets to test.

saiyam3243 · 2023-11-27T18:46:14Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?

Also, I would need the secrets to test.

Will update you after team meet and secrets are sent.

robinholzi · 2023-11-27T23:01:03Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?

Also, I would need the secrets to test.

Yes, that would also be a requirement before merging this!

robinholzi

Same comments as here

egekocabas · 2023-11-28T00:47:17Z

parma_db/firestore_db.py

+import firebase_admin
+import os
+from firebase_admin import credentials
+from firebase_admin import firestore


Looks like these imports are little bit mixed.

Also I couldn't find a package for firebase_admin and firebase-admin using micromamba. But after activating the micromamba environment, I was able to use pip command which is for the activated environment

(parma-mining-github) egekocabas@Eges-Air parma-mining-github % which pip /Users/egekocabas/micromamba/envs/parma-mining-github/bin/pip (parma-mining-github) egekocabas@Eges-Air parma-mining-github %

GitHub firebase-admin here
Firebase setup here

It seems that we need to use pip to install this dependency. If its the case then it should be added into the README.md

This caught on my eye.

I think in here we should add below line into Makefile --> install

pip install firebase-admin

then I worked for me 👌🏻

this pip install needs to be added to Makefile and also in Dockerfile

parma_db/firestore_db.py

robinholzi · 2023-11-30T14:14:56Z

@saiyam3243 How's the status here? As this is still a task from last week I'd like to see this finished as soon as possible! As It's only ~80 lines of code , I think this shouldn't be too hard to finish in time.

Please let us know if you can foreseeably not finish all the tasks assigned to you so we can reallocate our resources accordingly!

Constantin343 · 2023-12-02T15:51:50Z

@saiyam3243 @robinholzi Would be great if we can merge this soon. Not being able to use the db is starting to be a blocker for us.

robinholzi · 2023-12-02T16:15:25Z

@saiyam3243 @robinholzi Would be great if we can merge this soon. Not being able to use the db is starting to be a blocker for us.

@Constantin343 I see that this topic is quite important for you, that's why we already started it in a sprint of Nov. 20th.
As the respective PRs still seem to be far from ready I think I might need to take this over as @saiyam3243 is not available this weekend afaik (or maybe that changed with the weather conditions?)

I hope that you/your team (@Constantin343) and @saiyam3243 have finished the schema specifications by now. (meta linear task)

saiyam3243 · 2023-12-02T16:23:14Z

@saiyam3243 @robinholzi Would be great if we can merge this soon. Not being able to use the db is starting to be a blocker for us.

Hi @Constantin343, I am waiting for your response on slack since two days on the topic of firestore db schema, my implementation is ready but I am just waiting if you need any changes there, if you think my proposed schema is fine, I can push my changes. I have already pushed my code in analytics team and the code will look similar but with few changes.

Constantin343 · 2023-12-02T16:54:49Z

@saiyam3243 @robinholzi Would be great if we can merge this soon. Not being able to use the db is starting to be a blocker for us.

Hi @Constantin343, I am waiting for your response on slack since two days on the topic of firestore db schema, my implementation is ready but I am just waiting if you need any changes there, if you think my proposed schema is fine, I can push my changes. I have already pushed my code in analytics team and the code will look similar but with few changes.

@saiyam3243 sorry, somehow I didn't get notified by Slack and therefore did not give you feedback on the schema. Do you need something else from data sourcing to proceed?

robinholzi · 2023-12-02T19:18:16Z

@Constantin343 @saiyam3243 Ping me on Slack once you resolve the unclear points!

Constantin343 · 2023-12-03T12:02:27Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?

Also, I would need the secrets to test.

@saiyam3243 How is the access handeled now? I saw that I can define the datasource as a string to write to a collection (e.g., function add_new_raw_data), which looks like we can access all collections from every data source or is there already some mechanism in place to restrict the access?

egekocabas

I left some comments 👌🏻 but before starting, it is better to update this branch first (merge from main to your branch)

.github/workflows/ci.yml

egekocabas · 2023-12-03T12:55:31Z

parma_db/firestore_db.py

+import firebase_admin
+import os
+from firebase_admin import credentials
+from firebase_admin import firestore


this pip install needs to be added to Makefile and also in Dockerfile

parma_db/firestore_db.py

parma_db/firestore_service.py

Constantin343 · 2023-12-03T15:14:37Z

parma_db/firestore_service.py

+                .collection(page_id)
+                .document("raw_data")
+            )
+            doc_ref.set(raw_data_content)


So that would mean we overwrite the old raw_data or how would it work? I think we need a way to ensure we don't do that, maybe we can add a new document with the timestamp every time we write raw data.
It is important for the client that we collect the data over time

Yes, as I mentioned in the last comment, we can add time stamp and last _modified_page fields to the pages document to avoid raw_data to get overwritten!

But why is page_id a string here and not a dict as before? That seems to be a bit inconsistent

thats a mistake. changed now!

Constantin343 · 2023-12-04T10:15:20Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?
Also, I would need the secrets to test.

@saiyam3243 How is the access handeled now? I saw that I can define the datasource as a string to write to a collection (e.g., function add_new_raw_data), which looks like we can access all collections from every data source or is there already some mechanism in place to restrict the access?

Any updates regarding this topic?

…o-sourcing-repos

saiyam3243 · 2023-12-04T12:04:42Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?
Also, I would need the secrets to test.

@saiyam3243 How is the access handeled now? I saw that I can define the datasource as a string to write to a collection (e.g., function add_new_raw_data), which looks like we can access all collections from every data source or is there already some mechanism in place to restrict the access?

Any updates regarding this topic?

When using the Firebase Admin SDK, security rules do not apply because it assumes you have administrative privileges. To restrict access on the server side, we need to implement our logic like defining a list of authenticated users for a particular data source and checking their access every time!

saiyam3243 · 2023-12-04T18:20:14Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?
Also, I would need the secrets to test.

@saiyam3243 How is the access handeled now? I saw that I can define the datasource as a string to write to a collection (e.g., function add_new_raw_data), which looks like we can access all collections from every data source or is there already some mechanism in place to restrict the access?

Any updates regarding this topic?

When using the Firebase Admin SDK, security rules do not apply because it assumes you have administrative privileges. To restrict access on the server side, we need to implement our logic like defining a list of authenticated users for a particular data source and checking their access every time!

I found out a way. Within firestore db rules, we can specify the rights of the user/devs, in that way we can save db to get accessed by anyone

robinholzi · 2023-12-05T17:23:51Z

Does this give "selective" access to the database? I.e., only the part that is relevant for the Github module, so we can't interfere with other modules writing to the db?
Also, I would need the secrets to test.

@saiyam3243 How is the access handeled now? I saw that I can define the datasource as a string to write to a collection (e.g., function add_new_raw_data), which looks like we can access all collections from every data source or is there already some mechanism in place to restrict the access?

Any updates regarding this topic?

When using the Firebase Admin SDK, security rules do not apply because it assumes you have administrative privileges. To restrict access on the server side, we need to implement our logic like defining a list of authenticated users for a particular data source and checking their access every time!

I found out a way. Within firestore db rules, we can specify the rights of the user/devs, in that way we can save db to get accessed by anyone

fyi. For that, we need also need to authenticate first and have "service accounts" within the firebase auth service. I will build something for that as currently there's nothing in place as far as I can see.

robinholzi · 2023-12-05T19:31:05Z

This solution uses the firebase_admin SDK which is NOT appropriate for usage in sourcing modules as it grants root like access to Firestore. This access cannot be restricted (through e.g. Firestore rules). Therefore we cannot rely on direct Firebase access here. Therefore I'll close this PR.

feat: Firestore stag db config

b6ad9a9

saiyam3243 requested review from egekocabas, Nomiez and Constantin343 November 27, 2023 14:20

saiyam3243 self-assigned this Nov 27, 2023

saiyam3243 requested a review from robinholzi as a code owner November 27, 2023 14:20

github-actions bot added the enhancement New feature or request label Nov 27, 2023

egekocabas reviewed Nov 27, 2023

View reviewed changes

.github/workflows/ci.yml Outdated Show resolved Hide resolved

egekocabas mentioned this pull request Nov 27, 2023

build: Add Continuous Deployment & terraform #5

Merged

8 tasks

saiyam3243 closed this Nov 27, 2023

saiyam3243 reopened this Nov 27, 2023

robinholzi requested changes Nov 27, 2023

View reviewed changes

egekocabas requested changes Nov 28, 2023

View reviewed changes

Saiyam Jain added 2 commits December 3, 2023 00:28

feat: Firestore stag config

77b9d7d

feat: Firestore stag config

9285f6a

egekocabas requested changes Dec 3, 2023

View reviewed changes

Constantin343 reviewed Dec 3, 2023

View reviewed changes

parma_db/firestore_service.py Show resolved Hide resolved

Constantin343 reviewed Dec 3, 2023

View reviewed changes

Saiyam Jain added 3 commits December 4, 2023 09:58

feat: Remove unwanted code

9a2f64f

feat: Changed file structure

10cfc21

feat: Firestore config code changes

308dab3

Merge branch 'main' into feature/meta-36-connect-firestore-database-t…

ba67e11

…o-sourcing-repos

feat: Firestore new methods

7239842

feat: Firestore few changes

d1aca00

robinholzi marked this pull request as draft December 5, 2023 17:28

robinholzi closed this Dec 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Firestore stag db config #6

feat: Firestore stag db config #6

saiyam3243 commented Nov 27, 2023 •

edited

Loading

linear bot commented Nov 27, 2023

github-actions bot commented Nov 27, 2023 •

edited

Loading

egekocabas commented Nov 27, 2023

saiyam3243 commented Nov 27, 2023

Constantin343 commented Nov 27, 2023

saiyam3243 commented Nov 27, 2023

robinholzi commented Nov 27, 2023

robinholzi left a comment

egekocabas Nov 28, 2023

egekocabas Nov 30, 2023

egekocabas Dec 3, 2023

robinholzi commented Nov 30, 2023

Constantin343 commented Dec 2, 2023

robinholzi commented Dec 2, 2023

saiyam3243 commented Dec 2, 2023

Constantin343 commented Dec 2, 2023

robinholzi commented Dec 2, 2023

Constantin343 commented Dec 3, 2023

egekocabas left a comment •

edited

Loading

egekocabas Dec 3, 2023

Constantin343 Dec 3, 2023

saiyam3243 Dec 4, 2023

Constantin343 Dec 4, 2023

saiyam3243 Dec 4, 2023

Constantin343 commented Dec 4, 2023

saiyam3243 commented Dec 4, 2023

saiyam3243 commented Dec 4, 2023

robinholzi commented Dec 5, 2023

robinholzi commented Dec 5, 2023

feat: Firestore stag db config #6

feat: Firestore stag db config #6

Conversation

saiyam3243 commented Nov 27, 2023 • edited Loading

Motivation

Changes

Checklist

linear bot commented Nov 27, 2023

github-actions bot commented Nov 27, 2023 • edited Loading

☂️ Python Coverage

Overall Coverage

New Files

Modified Files

egekocabas commented Nov 27, 2023

saiyam3243 commented Nov 27, 2023

Constantin343 commented Nov 27, 2023

saiyam3243 commented Nov 27, 2023

robinholzi commented Nov 27, 2023

robinholzi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

robinholzi commented Nov 30, 2023

Constantin343 commented Dec 2, 2023

robinholzi commented Dec 2, 2023

saiyam3243 commented Dec 2, 2023

Constantin343 commented Dec 2, 2023

robinholzi commented Dec 2, 2023

Constantin343 commented Dec 3, 2023

egekocabas left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Constantin343 commented Dec 4, 2023

saiyam3243 commented Dec 4, 2023

saiyam3243 commented Dec 4, 2023

robinholzi commented Dec 5, 2023

robinholzi commented Dec 5, 2023

saiyam3243 commented Nov 27, 2023 •

edited

Loading

github-actions bot commented Nov 27, 2023 •

edited

Loading

egekocabas left a comment •

edited

Loading