depr(python,rust!): Rename parameter `by` to `group_by` in `DataFrame.upsample/group_by_dynamic/rolling` #14840

MarcoGorelli · 2024-03-04T15:08:49Z

This hasn't been fully discussed, but it's related to #10989 and it was pretty easy to F2-replace these. Sometimes, just opening a PR is the fastest way to make a decision, so... here this is 😄

In summary: by in Polars usually refers to which operation you do the operation by (e.g. in sort, it sorts by the by columns), and sometimes to which columns you group by before applying the operation. For example:

df.upsample('a', '3d', by='b') means "group by 'b' and then upsample by column 'a' every 3 days"
df.top_k(3, by='b') means "take the 3 rows where column 'b' is largest"

If an operation lets you group by certain columns before applying the operation, then I think it would be clearer to use group_by for that parameter.

This would open up the doors for df.top_k(3, by='b', group_by='a') (the by in top_k is already taken)

For readability, looking at the tests, I do think

out = df.rolling("times", period="5i", group_by=["groups"])

is clearer to read than

out = df.rolling("times", period="5i", by=["groups"])

The latter almost looks like it's rolling based on groups, whereas it's rolling based on 'times' grouped by 'groups'

…_dynamic, and dataframe.rolling

codecov · 2024-03-04T16:17:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 80.94%. Comparing base (baacf3d) to head (8b13ee1).
Report is 147 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #14840      +/-   ##
==========================================
- Coverage   80.94%   80.94%   -0.01%     
==========================================
  Files        1327     1327              
  Lines      172081   172102      +21     
  Branches     2453     2453              
==========================================
+ Hits       139290   139305      +15     
- Misses      32320    32327       +7     
+ Partials      471      470       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

stinodego

Thanks Marco, good move 👍

depr: rename by to group_by in dataframe.upsample, dataframe.group_by…

8b13ee1

…_dynamic, and dataframe.rolling

github-actions bot added deprecation Add a deprecation warning to outdated functionality python Related to Python Polars rust Related to Rust Polars labels Mar 4, 2024

MarcoGorelli marked this pull request as ready for review March 4, 2024 15:29

MarcoGorelli requested review from ritchie46, stinodego, c-peters, alexander-beedie and orlp as code owners March 4, 2024 15:29

MarcoGorelli mentioned this pull request Mar 8, 2024

Inconsistent by meaning in rolling_* and group_by_rolling - rename? #10989

Closed

stinodego changed the title ~~depr: rename by to group_by in dataframe.upsample, dataframe.group_by_dynamic, and dataframe.rolling~~ depr(python,rust!): rename by to group_by in dataframe.upsample, dataframe.group_by_dynamic, and dataframe.rolling Mar 21, 2024

github-actions bot added the breaking rust Change that breaks backwards compatibility for the Rust crate label Mar 21, 2024

stinodego changed the title ~~depr(python,rust!): rename by to group_by in dataframe.upsample, dataframe.group_by_dynamic, and dataframe.rolling~~ depr(python,rust!): Rename parameter by to group_by in DataFrame.upsample/group_by_dynamic/rolling Mar 21, 2024

stinodego approved these changes Mar 21, 2024

View reviewed changes

stinodego merged commit d7339ad into pola-rs:main Mar 21, 2024
31 checks passed

henryharbeck mentioned this pull request Apr 7, 2024

docs(python): Update leftover references of by parameter to group_by in DataFrame/LazyFrame.upsample/group_by_dynamic/rolling #15527

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

depr(python,rust!): Rename parameter `by` to `group_by` in `DataFrame.upsample/group_by_dynamic/rolling` #14840

depr(python,rust!): Rename parameter `by` to `group_by` in `DataFrame.upsample/group_by_dynamic/rolling` #14840

MarcoGorelli commented Mar 4, 2024 •

edited

Loading

codecov bot commented Mar 4, 2024 •

edited

Loading

stinodego left a comment

depr(python,rust!): Rename parameter by to group_by in DataFrame.upsample/group_by_dynamic/rolling #14840

depr(python,rust!): Rename parameter by to group_by in DataFrame.upsample/group_by_dynamic/rolling #14840

Conversation

MarcoGorelli commented Mar 4, 2024 • edited Loading

codecov bot commented Mar 4, 2024 • edited Loading

Codecov Report

stinodego left a comment

Choose a reason for hiding this comment

depr(python,rust!): Rename parameter `by` to `group_by` in `DataFrame.upsample/group_by_dynamic/rolling` #14840

depr(python,rust!): Rename parameter `by` to `group_by` in `DataFrame.upsample/group_by_dynamic/rolling` #14840

MarcoGorelli commented Mar 4, 2024 •

edited

Loading

codecov bot commented Mar 4, 2024 •

edited

Loading