Skip to content

Latest commit

 

History

History
15 lines (11 loc) · 599 Bytes

README.md

File metadata and controls

15 lines (11 loc) · 599 Bytes

tor-spider

A spider for Hidden Services that uses the excellent colly library to collect web pages.

The default implementation uses Elastic Search to save the pages, mongoDB to store the URLs to be crawled and redis to check if a URL has ben visited or not.

You can use Docker to run it, just build the image with make docker and run the containers with make compose. After it's running, you can add a url to start from with curl http://localhost:8080?url=<URL>

Everything is a WIP and there are a lot of things that needs to be fixed or implemented.