Skip to content

A web scraper that grabs detailed property data from thelandbank.org and saves it to a SQL db

Notifications You must be signed in to change notification settings

d-towns/flispi_scrapy

Repository files navigation

Flispi Scrapy

Description

This is a simple scrapy project that scrapes the Genesee County Land Bank website and saves the data to a postgres database.

Due to the nature of the website, the scraper had to be written using two differnt spiders. The first spider scrapes the list of properties and the second spider scrapes the details of each property.

Installation

  1. Clone the repository
  2. Install the requirements
  3. Create a postgres database
  4. Create a .env file in the root directory of the project and add the following variables:
    • PROD_POSTGRESS_URL
    • DEV_POSTGRESS_URL
    • GOOGLE_API_KEY
    • ENV
  5. Start your virtual environment
  6. Run the following command to start the scraper:
    • scrapy crawl landbank_spider

About

A web scraper that grabs detailed property data from thelandbank.org and saves it to a SQL db

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages