pymupdf-fitz

Here are 42 public repositories matching this topic...

kalyaninagaraj / NFHS5

Python code to read, retrieve, analyze, and plot district-level findings from official (pdf) publications of the 5th National Family Health Survey of India

principal-component-analysis kmeans-clustering geopandas matplotlib-pyplot tutorial-python nfhs5 pymupdf-fitz

Updated May 24, 2022
Jupyter Notebook

bilalhameed248 / PDF-Document-Extraction

Star

Python PDF-to-HTML Converter: Transforming PDF Documents into Structured HTML Tags. - Feb 2022 - Jun 2023

python pdf parser parsing extraction python3 document fitz pymupdf pymupdf-fitz

Updated Nov 5, 2023
Python

RomyJr / Retrocession_Detector

Star

This application facilitates the comparison of two PDF files. Differences are presented in a table, color-coded as red (deletions), green (additions), and orange (moved text). Users can save the results in Excel format. It is designed to check whether annotations have been taken into account during the comparison process.

pyqt5 openpyxl pypdf2 difflib pandas-library fitz pymupdf pymupdf-fitz

Updated Nov 17, 2023
Python

RomyJr / PDF_TXT_Word_research

Star

This application simplifies PDF keyword searches, allowing users to easily find specific terms in files or folders. Results are displayed clearly, and the history feature enables quick review and filtering of past searches. Users can click on document links in the history to open them directly in the default PDF viewer.

pyqt5 pymupdf pymupdf-fitz

Updated Nov 22, 2023
Python

helgesander02 / TKFruitMG

Star

An ERP system that uses customtkinter as the GUI base, with a postgreSQL database and reportlab, win32print, and pymupdf-fitz design.

postgresql reportlab customtkinter pymupdf-fitz win32print

Updated Dec 5, 2023
Python

ashutosh6500 / Resume-Parser-AWS-Event-Driven-Workflow

Star

This is simple event driven mini project based on different AWS services like Lambda,EC2,Dynamodb,S3,SNS etc

aws resume-parser event-driven-architecture lambda-layers pymupdf-fitz aws-projects

Updated Jan 8, 2024
Python

Sazizi2025 / PDF-Founder

Star

Are you short on time?! Can't you search all the PDFs one by one for the content you want?! Well, PDF-Founder is here...

python pdf gui image tesseract rgb graphical tesseract-ocr easy-to-use image-generator snipping pdf-search-engine pymupdf pysimplegui pdf-search ptl pymupdf-fitz

Updated Jan 8, 2024
Python

mcagriaksoy / diff_merge_pdf

Star

A tool for compare, merge, display difference and make OCR between the PDFs.

pdf-viewer pdf-generator pdf-merger ocr-recognition pdf-comparison x-ray-images ocr-text-reader diff-tool pdf-document-processor pdf-ocr-extraction pyqt6-desktop-application pymupdf-fitz pdf-ocr pdf-visual-testing diff-tool-pdf

Updated Jan 21, 2024
Python

vickypandey14 / Convert-PDF-into-Image-By-Python

Star

This Python script converts each page of a PDF document into separate image files. It utilizes the PyMuPDF library (fitz) to handle PDF operations and the Python Imaging Library (PIL) for image processing.

python python-script pdf-converter pymupdf pymupdf-fitz

Updated Feb 22, 2024
Python

raju-2003 / KSP-DATATHON-24

Star

Data Privacy in Law Enforcement - KSP DATATHON - 2024 - FIR Redactor

mongodb python3 jwt-authentication pdfreader streamlit-webapp openai-api pymupdf-fitz

Updated May 21, 2024
Python

IglesiasT / comparador-pdfs

Star

python pdf-comparison pymupdf-fitz

Updated Aug 7, 2024
Python

OtenMoten / pdf-alchemist

Star

It's designed for transmuting PDFs into HTML. Harness the power of OCR, image processing, and web technologies to unlock the secrets within your PDF documents.

python pdf-converter pillow tesseract-ocr beautifulsoup4 pdf-document-processor dominate pymupdf-fitz tdqm

Updated Aug 9, 2024
Python

devbm7 / QGen

Star

Question Generator System

nlp json ml transformers pandas python3 pytorch spacy wikipedia-api nltk smtp regular-expressions streamlit pymupdf-fitz t5-large

Updated Oct 21, 2024
Python

das-amlan / PDF_Image_Extractor_Web_App

Star

This is a simple web app that allows users to upload a PDF file, extract images from the PDF, and display the images in the web app.

python html flask fitz streamlit pymupdf-fitz

Updated Dec 1, 2024
Python

FrancisLauriano / chatsoftex

Star

Plataforma desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação tecnológica, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.

Updated Dec 18, 2024
Python

ParthaPRay / pdf_text_extraction_json_section_subsection

Star

This repo contains codes for extraction of PDF text to JSON to show section number, section title, section body content, footnote

pdf json text regex extraction document article-extractor pymupdf-fitz

Updated Dec 23, 2024
Python

atthharvva / PDF-Form-Reader

Star

This Python script extracts information from PDF forms using OCR (Optical Character Recognition) and saves the extracted data into an Excel file. It is particularly designed for processing forms with checkboxes and textual fields. The script can handle variations in form structure and allows for easy customization to accommodate other PDF form type

python forms pillow pdf-forms openpyxl csv-export ocr-text-reader pdf-document-processor pymupdf-fitz graphical-checkboxes

Updated Jan 9, 2025
Python

Kurama-90 / GUI-PDF-to-Excel

Star

PyQt5-based GUI application that allows users to convert PDF files into Excel files. The application provides multiple options for extracting data from PDFs, including tables, text, and OCR (Optical Character Recognition).

python pdf data gui numpy excel pyqt5 pandas poppler opencv-python pdf2image pdfplumber easyocr pymupdf-fitz

Updated Feb 23, 2025
Python

ifte110 / Serach_all_pdfs_by_string

Star

Search through all pdf files in a folder for a specific keyword or string of keywords.

python pymupdf-fitz pdfsearchtool

Updated Feb 27, 2025
Python

Jatin-s16 / Resume-check-portal-for-candidates

Star

A Streamlit-based application that enables job seekers to evaluate and enhance their resumes by analyzing alignment with specific job descriptions, providing actionable insights for improvement.

python nlp regex cosine-similarity spacy-nlp streamlit sentence-transformers pymupdf-fitz

Updated Apr 8, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the pymupdf-fitz topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pymupdf-fitz topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pymupdf-fitz

Here are 42 public repositories matching this topic...

kalyaninagaraj / NFHS5

bilalhameed248 / PDF-Document-Extraction

RomyJr / Retrocession_Detector

RomyJr / PDF_TXT_Word_research

helgesander02 / TKFruitMG

ashutosh6500 / Resume-Parser-AWS-Event-Driven-Workflow

Sazizi2025 / PDF-Founder

mcagriaksoy / diff_merge_pdf

vickypandey14 / Convert-PDF-into-Image-By-Python

raju-2003 / KSP-DATATHON-24

IglesiasT / comparador-pdfs

OtenMoten / pdf-alchemist

devbm7 / QGen

das-amlan / PDF_Image_Extractor_Web_App

FrancisLauriano / chatsoftex

ParthaPRay / pdf_text_extraction_json_section_subsection

atthharvva / PDF-Form-Reader

Kurama-90 / GUI-PDF-to-Excel

ifte110 / Serach_all_pdfs_by_string

Jatin-s16 / Resume-check-portal-for-candidates

Improve this page

Add this topic to your repo