w3lib

Overview

This is a Python library of web-related functions, such as:

remove comments, or tags from HTML snippets
extract base url from HTML snippets
translate entites on HTML strings
convert raw HTTP headers to dicts and vice-versa
construct HTTP auth header
converting HTML pages to unicode
sanitize urls (like browsers do)
extract arguments from urls

Requirements

Python 3.9+

Install

pip install w3lib

Documentation

See http://w3lib.readthedocs.org/

License

The w3lib library is licensed under the BSD license.

Name		Name	Last commit message	Last commit date
Latest commit History 505 Commits
.github/workflows		.github/workflows
docs		docs
tests		tests
w3lib		w3lib
.bandit.yml		.bandit.yml
.bumpversion.cfg		.bumpversion.cfg
.coveragerc		.coveragerc
.flake8		.flake8
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
NEWS		NEWS
README.rst		README.rst
codecov.yml		codecov.yml
conftest.py		conftest.py
mypy.ini		mypy.ini
pylintrc		pylintrc
pytest.ini		pytest.ini
setup.py		setup.py
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

w3lib

Overview

Requirements

Install

Documentation

License

About

Releases 11

Packages

Contributors 43

Languages

License

scrapy/w3lib

Folders and files

Latest commit

History

Repository files navigation

w3lib

Overview

Requirements

Install

Documentation

License

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 11

Packages 0

Contributors 43

Languages

Packages