Skip to content

eliasbnk/web-scraping-sandbox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scrapethissite.com - Web Scraping Exercises

This repository contains Python scripts of my attempt at web scraping exercises found on Scrapethissite.com.

Exercises:

  1. Hockey Teams: Forms, Searching and Pagination: Scrape NHL team stats from a page with pagination and tables.

  2. Oscar Winning Films: AJAX and Javascript: Scrape award-winning films information from a page with asynchronous content loading.

  3. Countries of the World: A Simple Example: Scrape country information from a single page.

Installation:

  1. Clone this repository:

    git clone git@github.com:eliasbnk/web-scraping-sandbox.git
  2. Navigate to the project directory:

    cd web-scraping-sandbox
  3. Create a virtual environment:

    python -m venv venv
  4. Activate the virtual environment:

    • Windows:
    venv\Scripts\activate
    • Mac/Linux:
    source venv/bin/activate
  5. Install dependencies:

    pip install -r requirements.txt

Usage:

Run scripts to complete exercises. For example:

python teams.py

License:

MIT License. See LICENSE.

Releases

No releases published

Packages

No packages published

Languages