-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Discovery Performance #284
Comments
I stumbled upon the same issue. Looks like the root cause is in |
This would probably require the SDK to move to SQLAlchemy 2.0 in order to use all the |
@florian-ernst-alan interesting. Is this the feature that would be relevant here?
|
I think it would yes. Basically, instead of getting all informations individually (for each table, get each column names, get each primary key...), get all of them at once and then use them downstream. Let's say I'm 90% sure. |
Overrides the SDK functions to instead use the `get_multi_*` functions from SQLAlchemy Inspector. On our database of ~120 tables, this reduces the discovery runtime from 10-12 minutes to about 30 seconds. - Closes #284
Original Issue in Slack Thread
tap-postgres==v0.0.2
Performance on discovery is taking about 1 second per table. This is extremely slow for large schemas (even 60 tables would take a minute)
Discovery on remote PSQL 11 server on high perfomance internet connection. Schema 3452 tables.
Please try and replicate the issue. Might still be something on our end
Proposed solutions
The text was updated successfully, but these errors were encountered: