Skip to content

SpiderUnleashed is a webcrawler that retrieves the non-pdf results from educational websites because pdfs are easily available on Google Scholar. The application uses the inbuilt library of HTTrack that crawls the webpages of the provided topic to the application.

Notifications You must be signed in to change notification settings

aakashchandhoke/SpiderUnleashed

Repository files navigation

Crawling Educational Websites


Aakash Chandhoke
Motilal Nehru National Institute of Technology, Allahabad
aakash.chandhoke24@gmail.com


Akhilesh Kumar
Motilal Nehru National Institute of Technology, Allahabad
akhileshkumarhbti@gmail.com


Shubham Kumar
Motilal Nehru National Institute of Technology, Allahabad
shubham.kumar.sci@gmail.com



SpiderUnleashed is a webcrawler that retrieves the non-pdf results from educational websites because pdfs are easily available on Google Scholar. The application uses the inbuilt library of HTTrack that crawls the webpages of the provided topic to the application.

About

SpiderUnleashed is a webcrawler that retrieves the non-pdf results from educational websites because pdfs are easily available on Google Scholar. The application uses the inbuilt library of HTTrack that crawls the webpages of the provided topic to the application.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages