Skip to content

web scraping tool which scrape discussions from GitHub based on a specified search query

License

Notifications You must be signed in to change notification settings

shubhamp1/GitHub_discussion_scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

The Ruby script is a web scraping tool. it's scrape discussions from GitHub based on a specified search query. It extracts discussion content and associated comments, organizing the data into a CSV file for further analysis or storage.

To use this script, follow these steps:

  1. Save the provided Ruby script into a file named github_comments.rb.
  2. Ensure you have Ruby installed on your system.
  3. Open a terminal or command prompt and navigate to the directory where github_comments.rb is saved.
  4. Run the script using the command ruby github_comments.rb.
  5. The script will prompt you to enter a comment word. Provide the desired keyword for searching discussions on GitHub.
  6. The script will start crawling GitHub discussions based on the provided keyword, scrape the content, and store the data in a CSV file named with a timestamp and the search query.

Steps for Clone Repository
(Run this Command in Terminal)

  $ git clone https://github.com/shubhamp1/GitHub_discussion_scraper.git
  $ cd GitHub_discussion_scraper
  $ ruby github_comments.rb 

About

web scraping tool which scrape discussions from GitHub based on a specified search query

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages