These examples show how to work with PDF document in Python
- Extract images from a pdf Use pyMuPDF to extract imges from all pages
- Extract metadata from a pdf Use pyMuPDF to extract metadata from all pages
- Extract pages from a pdf Use pyMuPDF to extract all pages and render the contents as an image
- Extract text from a pdf Use pyMuPDF to extract text from all pages
python3 extract_images.py --input <path to pdf> --output ./output-images
python3 extract_metadata.py --input <path to pdf>
python3 extract_pages.py --input <path to pdf> --output ./output --dpi 300
python3 extract_test.py --input <path to pdf> --output ./output