Version 2.1.0: Caching, and much more
We have a lot of new stuff in this release.
-
Caching
- We now cache most of the time expensive scraping functions. In addition, we have a consistent storage location for the Lahman Databank, so no more five copies of baseball-databank. This is currently disabled by default, you can enable it (see caching in the docs).
- Basically, now when you call something like
batting_stats(2018)
, for a while subsequent calls will just hit the cache, if it is enabled, so you don't need to do a bunch of pd.load and df.to_csv all over your notebooks. - Right now this is a week by default, which is configurable in code, and will probably change soon to more specific per-function staleness. @TheCleric, @schorrm
-
FanGraphs fielding - @TheCleric
-
Massively expanded options for FanGraphs scraping - @TheCleric
-
Marcel projections - @bdilday
-
Batted ball trajectories - @bdilday
-
Baseball Reference splits - @mwisnie5
-
Add spray angle to statcast dataframes - @tjburch
-
Flag imputed data (where Trackman didn't do it, stringers did) in statcast batting - @tjburch
- See this piece in The Hardball Times for more info.
-
Plot batted ball profile - @tjburch
-
Improvements to data type inference - @TheCleric
-
Unit testing, by @TheCleric
-
And various bugfixes, with thanks to @bdilday, @bgunn34, and @TheCleric.