Skip to content

peteb206/canadians-in-college-baseball

Repository files navigation

canadians-in-college-baseball

Python web scraper for the Canadian Baseball Network... view full project description

scrape-players update-players-sheet

  • Scan NCAA, NJCAA, NAIA, etc. schools' baseball rosters for players whose hometown references Canada or a Canadian city or province.
  • Clean and format data due to differences in each school's website formats.
  • Export results to Google Sheets and display using gspread package.

scrape-stats update-stats-sheet

  • Locate the season statistics of the players found by the Canadians in College Baseball scraper.
  • Clean and format the data found from the NCAA, NJCAA, NAIA, etc. websites.
  • Export results to Google Sheets and display using gspread package.

Python (3.11.2) Packages

unit-tests

Web Scraping

  • requests (2.28.1)
  • beautifulsoup4 (4.11.1)
  • lxml (4.9.1)
  • html5lib (1.1)
  • json (built-in)

Data Manipulation

  • pandas (1.4.4)
  • re (built-in)

Google Sheets API

  • gspread (5.7.1)
  • oauth2client (4.1.3)