pymupdf
Here are 79 public repositories matching this topic...
Integracion LLamaIndex with NVIDIA NIM
-
Updated
Nov 10, 2024 - Jupyter Notebook
A Python-based tool that converts PDF files into editable Word documents, preserving text, images, and layout. Uses PyPDF2, PyMuPDF (fitz), python-docx, and Pillow to accurately transfer content from PDF to .docx. Ideal for transforming complex PDFs into Word format for easy editing.
-
Updated
Nov 8, 2024 - Python
Freeplane script to organise highlighted text and notes from pdf files as Freeplane mindmap
-
Updated
Nov 8, 2024 - Tcl
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
-
Updated
Nov 11, 2024 - Python
A Python application that extracts text and images from PDFs, applies OCR to images using Tesseract, and stores the results in a SQLite database. The application features a GUI for searching both text and OCR-extracted content and previewing PDF files.
-
Updated
Nov 1, 2024 - Python
A Python tool to compress PDF files by downscaling and compressing embedded images. Uses PyMuPDF and Pillow to optimize file size, making PDFs easier to store and share while preserving layout and quality.
-
Updated
Oct 30, 2024 - Jupyter Notebook
ConversAI is an innovative conversational AI framework designed for intelligent text extraction and querying across various document formats and web content, leveraging advanced natural language processing techniques.
-
Updated
Oct 16, 2024 - Python
Multimodal LLM Application with PyMuPDF4LLM
-
Updated
Oct 4, 2024 - Jupyter Notebook
a miniplayer for pdf documents
-
Updated
Sep 25, 2024 - Python
Open source Python library for converting PDF to DOCX.
-
Updated
Sep 23, 2024 - Python
Generates an Acronym List for your PDF quickly and locally for over 200 pages of text
-
Updated
Sep 18, 2024 - Python
A PDF manipulation and access application developed in Python using the PyMuPDF and CustomTkinter modules.
-
Updated
Sep 7, 2024 - Python
An automated system for a health insurance company to streamline document processing, including template matching and fraud detection, resulting in reduction of processing time.
-
Updated
Sep 3, 2024 - Python
A Streamlit-based chatbot application utilizing Groq API and Langchain for conversational AI.
-
Updated
Sep 2, 2024 - Python
Improve this page
Add a description, image, and links to the pymupdf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pymupdf topic, visit your repo's landing page and select "manage topics."