Python code to read, retrieve, analyze, and plot district-level findings from official (pdf) publications of the 5th National Family Health Survey of India
-
Updated
May 24, 2022 - Jupyter Notebook
Python code to read, retrieve, analyze, and plot district-level findings from official (pdf) publications of the 5th National Family Health Survey of India
Python PDF-to-HTML Converter: Transforming PDF Documents into Structured HTML Tags. - Feb 2022 - Jun 2023
This application facilitates the comparison of two PDF files. Differences are presented in a table, color-coded as red (deletions), green (additions), and orange (moved text). Users can save the results in Excel format. It is designed to check whether annotations have been taken into account during the comparison process.
This application simplifies PDF keyword searches, allowing users to easily find specific terms in files or folders. Results are displayed clearly, and the history feature enables quick review and filtering of past searches. Users can click on document links in the history to open them directly in the default PDF viewer.
An ERP system that uses customtkinter as the GUI base, with a postgreSQL database and reportlab, win32print, and pymupdf-fitz design.
This is simple event driven mini project based on different AWS services like Lambda,EC2,Dynamodb,S3,SNS etc
Are you short on time?! Can't you search all the PDFs one by one for the content you want?! Well, PDF-Founder is here...
A tool for compare, merge, display difference and make OCR between the PDFs.
This Python script converts each page of a PDF document into separate image files. It utilizes the PyMuPDF library (fitz) to handle PDF operations and the Python Imaging Library (PIL) for image processing.
Data Privacy in Law Enforcement - KSP DATATHON - 2024 - FIR Redactor
It's designed for transmuting PDFs into HTML. Harness the power of OCR, image processing, and web technologies to unlock the secrets within your PDF documents.
Question Generator System
Plataforma desenvolvida em Python que visa automatizar e agilizar o processo de avaliação de projetos de inovação tecnológica, utilizando inteligência artificial e critérios padronizados com base na Lei do Bem.
This repo contains codes for extraction of PDF text to JSON to show section number, section title, section body content, footnote
This Python script extracts information from PDF forms using OCR (Optical Character Recognition) and saves the extracted data into an Excel file. It is particularly designed for processing forms with checkboxes and textual fields. The script can handle variations in form structure and allows for easy customization to accommodate other PDF form type
PyQt5-based GUI application that allows users to convert PDF files into Excel files. The application provides multiple options for extracting data from PDFs, including tables, text, and OCR (Optical Character Recognition).
Search through all pdf files in a folder for a specific keyword or string of keywords.
A Streamlit-based application that enables job seekers to evaluate and enhance their resumes by analyzing alignment with specific job descriptions, providing actionable insights for improvement.
Add a description, image, and links to the pymupdf-fitz topic page so that developers can more easily learn about it.
To associate your repository with the pymupdf-fitz topic, visit your repo's landing page and select "manage topics."