Skip to content

Version 2.1.0: Caching, and much more

Compare
Choose a tag to compare
@schorrm schorrm released this 14 Oct 18:16
· 93 commits to master since this release

We have a lot of new stuff in this release.

  • Caching

    • We now cache most of the time expensive scraping functions. In addition, we have a consistent storage location for the Lahman Databank, so no more five copies of baseball-databank. This is currently disabled by default, you can enable it (see caching in the docs).
    • Basically, now when you call something like batting_stats(2018), for a while subsequent calls will just hit the cache, if it is enabled, so you don't need to do a bunch of pd.load and df.to_csv all over your notebooks.
    • Right now this is a week by default, which is configurable in code, and will probably change soon to more specific per-function staleness. @TheCleric, @schorrm
  • FanGraphs fielding - @TheCleric

  • Massively expanded options for FanGraphs scraping - @TheCleric

  • Marcel projections - @bdilday

  • Batted ball trajectories - @bdilday

  • Baseball Reference splits - @mwisnie5

  • Add spray angle to statcast dataframes - @tjburch

  • Flag imputed data (where Trackman didn't do it, stringers did) in statcast batting - @tjburch

  • Plot batted ball profile - @tjburch

  • Improvements to data type inference - @TheCleric

  • Unit testing, by @TheCleric

  • And various bugfixes, with thanks to @bdilday, @bgunn34, and @TheCleric.