We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The initial API for writing Delta Lake is a little bit clunky for the user.
When reading, users have to do something like this:
from dask_deltatalbe import read_delta_table ddf = read_delta_table("path_to_table")
To write, they need this:
from dask_deltatable.write import to_deltalake out = to_deltalake("path_to_table", ddf) out.compute()
TODO:
read_delta_table
to_deltalake
read_delta_table/write_delta_table
read_deltalake/write_deltalake
read_delta_table/to_delta_table
read_deltalake/to_deltalake
compute
compute: bool
The text was updated successfully, but these errors were encountered:
read_foo / to_foo is the standard terminology in dask. I believe this is true for all IO APIs we're offering, see https://docs.dask.org/en/stable/dataframe-api.html#create-dataframes and https://docs.dask.org/en/stable/dataframe-api.html#store-dataframes
read_foo
to_foo
I suggest read_deltalake and to_deltalake
read_deltalake
to_deltalake should be exposed on top level, same as read_delta_table
+ 1
user shouldn't need to call compute.
We typically offer a compute kwarg to control this behavior. I'm fine adding this to to_deltalake as well.
Sorry, something went wrong.
Successfully merging a pull request may close this issue.
The initial API for writing Delta Lake is a little bit clunky for the user.
When reading, users have to do something like this:
To write, they need this:
TODO:
read_delta_table
vsto_deltalake
. Either of the following combos would be more consistent:read_delta_table/write_delta_table
read_deltalake/write_deltalake
read_delta_table/to_delta_table
read_deltalake/to_deltalake
to_deltalake
should be exposed on top level, same asread_delta_table
compute
as an extra step, addcompute: bool
kwarg insteadThe text was updated successfully, but these errors were encountered: