Project contains source code for scraping two websites containing information about lawyers. Code uses Google Chrome
driver and is made to run on Windows 10 OS
.
Python version 3.8
To run the code locally:
- IMP: You would need to install the web driver for your browser for selenium to work
- Clone the repository
cd
in theAdvocate
folder- Create a virtual environment using
venv
orconda
- Activate your environment and do
pip install -r requirements.txt
- Change your driver path in the files
- Line 60, 27:
driver = webdriver.Chrome("D:\\your\\path\\to\\driver\\here\\chromedriver.exe")
cd
in the respective directory andpython <script_name>_scrape.py
https://www.cyprusbar.org/CypriotAdvocateMembersPage.aspx
cd cyprus
python cyprus_scrape.py
cd davac
python davac_scrape.py
NOTE: The scraping would be slow as there is a lot of intialisation for this website. So don't close the program and keep your patience. It would work. Only close if there is some error displayed :D