Skip to content

Latest commit

 

History

History
5 lines (5 loc) · 798 Bytes

README.md

File metadata and controls

5 lines (5 loc) · 798 Bytes

Crawling Educational Websites


Aakash Chandhoke
Motilal Nehru National Institute of Technology, Allahabad
aakash.chandhoke24@gmail.com


Akhilesh Kumar
Motilal Nehru National Institute of Technology, Allahabad
akhileshkumarhbti@gmail.com


Shubham Kumar
Motilal Nehru National Institute of Technology, Allahabad
shubham.kumar.sci@gmail.com



SpiderUnleashed is a webcrawler that retrieves the non-pdf results from educational websites because pdfs are easily available on Google Scholar. The application uses the inbuilt library of HTTrack that crawls the webpages of the provided topic to the application.