-
Notifications
You must be signed in to change notification settings - Fork 76
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
33 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
![web scraping with python](https://github.com/rajat4665/web-scraping-with-python/blob/master/WEB%20SCRAPING.jpg) | ||
<br> | ||
<span style="text-decoration: underline;"><strong>Introduction:</strong></span> | ||
|
||
<b>Web scraping</b>, <b>web harvesting</b>, or <b>web data extraction</b> is data scraping used for extracting data from websites using its HTML structure, In this post, I will explain basic fundaments of web scraping using python and also explore it by a live demonstration with two python libraries Beautifulsoup and requests respectively. | ||
|
||
<span style="text-decoration: underline;"><strong>What you will learn from this post:</strong></span> | ||
<ul> | ||
<li>basic understanding of web scraping</li> | ||
<li>how to extract data from a website using classes and HTML tags</li> | ||
<li>how to use requests module to get data</li> | ||
<li>how to use Beautifulsoup</li> | ||
</ul> | ||
<span style="text-decoration: underline;"><strong>Requirements:</strong></span> | ||
<ul> | ||
<li>python3</li> | ||
<li>requests</li> | ||
<li>bs4</li> | ||
</ul> | ||
<span style="text-decoration: underline;"><strong>Installation:</strong></span> | ||
<ul> | ||
<li>sudo apt-get python3-pip</li> | ||
<li>pip3 install requests</li> | ||
<li>pip3 install bs4</li> | ||
</ul> | ||
|
||
<h2> How to run this code</h2> | ||
<ul> | ||
<li>there are two source code files, one is .py extention and another is .ipynb extention</li> | ||
<li>one can run Scraping with BeautifulSoup.py file in python by run this cammand in terminal "python3 Web Scraping with BeautifulSoup.py"</li> | ||
<li>one can run Scraping with BeautifulSoup.ipynb file in jupyter notebook /li> | ||
<li>one can install juypyter notebook by this command "pip3 install jupyter"</li> | ||
</ul> |