car-scraper

Web-scraping script that extracts data from a given url aquired by running a search query on 'https://www.otomoto.pl'. After providing the script with a link, the script will scrape the given page and then look for the next page in the html code. The loop runs for each consecutive page until the next url is nowhere to be found (the container returns NoneType). After that, the script saves all the scraped data to a .csv file which is then used to create data structures necessary to plot charts using pygal. The repository contains three main .py files. The main file is 'scrap.py', which imports user-defined functions from 'functions.py' and 'graph.py'.

Installation

Run:

python3 -m venv venv
source venv/bin/activate
pip3 install -r requirements.txt

Done. You can now run python3 scrap.py and let the scraper work.

Requirements

All the packages required to run the script properly are defined in the requirements.txt file.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
README.md		README.md
classes.py		classes.py
functions.py		functions.py
graph.py		graph.py
requirements.txt		requirements.txt
scrap.py		scrap.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

car-scraper

Installation

Requirements

Sample urls

About

Releases

Packages

Languages

mateuszgrzybek/car-scraper

Folders and files

Latest commit

History

Repository files navigation

car-scraper

Installation

Requirements

Sample urls

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages