Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow creating additional workers for snapshot page downloading to achieve a specific download speed #7

Open
JakeYallop opened this issue May 18, 2024 · 0 comments

Comments

@JakeYallop
Copy link
Owner

The downloading of web pages has a variable rate limit, and new workers are created to reach to desired limit. Downloading lists of CDX records (snapshot pages) can take several seconds, and with a high-enough webpage download rate limit, it could end up making the snapshot page downloading the limiting factor.

We should try to rewrite the snapshot downloading so that it can also be parallelised.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant