-
pdfium-binaries Public
Forked from bblanchon/pdfium-binaries📰 Binary distribution of PDFium
Shell UpdatedDec 20, 2025 -
PaddleX Public
Forked from PaddlePaddle/PaddleXAll-in-One Development Tool based on PaddlePaddle
Python Apache License 2.0 UpdatedNov 5, 2025 -
binary_staging Public
Stash for compiled auxiliary binaries. Check the Releases area.
UpdatedOct 31, 2025 -
cibuildwheel Public
Forked from pypa/cibuildwheel🎡 Build Python wheels for all the platforms with minimal configuration.
Python Other UpdatedAug 16, 2025 -
clipper Public
Forked from zyedidia/clipperCross-platform clipboard access in Go
Go MIT License UpdatedMay 31, 2025 -
ctypesgen Public
Forked from ctypesgen/ctypesgenPure-python wrapper generator for ctypes
Python BSD 2-Clause "Simplified" License UpdatedMay 7, 2025 -
pdfplumber Public
Forked from jsvine/pdfplumberPlumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Python MIT License UpdatedMay 2, 2025 -
libpdfium Public
Forked from tiran/libpdfiumRPM for Google PDFium / libpdfium.so
Shell UpdatedApr 9, 2025 -
nv-ingest Public
Forked from NVIDIA/NeMo-RetrieverNVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…
Python Apache License 2.0 UpdatedMar 6, 2025 -
PyMuPDF Public
Forked from pymupdf/PyMuPDFPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Python GNU Affero General Public License v3.0 UpdatedFeb 18, 2025 -
docling Public
Forked from docling-project/doclingDocling bundles PDF document conversion to JSON and Markdown in an easy, self-contained package.
Python MIT License UpdatedJul 23, 2024 -
pikepdf Public
Forked from pikepdf/pikepdfA Python library for reading and writing PDF, powered by qpdf
Python Mozilla Public License 2.0 UpdatedJul 4, 2024 -
pdftext Public
Forked from datalab-to/pdftextExtract structured text from pdfs quickly
Python Apache License 2.0 UpdatedApr 30, 2024 -
spek Public
Forked from alexkay/spekAcoustic spectrum analyser
C++ GNU General Public License v3.0 UpdatedJan 16, 2024 -
cpython Public
Forked from python/cpythonThe Python programming language
Python Other UpdatedJan 4, 2024 -
yt-dlp Public
Forked from yt-dlp/yt-dlpA youtube-dl fork with additional features and fixes
Python The Unlicense UpdatedDec 31, 2023 -
-
test_workflows Public
Personal experiments with GH workflows
BSD 3-Clause "New" or "Revised" License UpdatedNov 21, 2023 -
JSPyBridge Public
Forked from extremeheat/JSPyBridge🌉. Bridge to interoperate Node.js and Python
Python MIT License UpdatedNov 19, 2023 -
-
pypdfium2-feedstock Public
Forked from AnacondaRecipes/pypdfium2-feedstockBSD 3-Clause "New" or "Revised" License UpdatedNov 11, 2023 -
pdfium-binaries-feedstock Public
Forked from AnacondaRecipes/pdfium-binaries-feedstockRepack of pdfium2 binaries for macOS, linux and Windows.
Shell BSD 3-Clause "New" or "Revised" License UpdatedOct 31, 2023 -
camelot Public
Forked from camelot-dev/camelotA Python library to extract tabular data from PDFs
Python MIT License UpdatedSep 25, 2023 -
backports.cached_property Public
Forked from penguinolog/backports.cached_propertyPython 3.8 functools.cached_property backport to python 3.6
Python MIT License UpdatedJun 7, 2023 -
scantailor-libs-build Public
Forked from ScanTailor-Advanced/scantailor-libs-buildBuilding scantailor and its dependencies
CMake UpdatedJun 7, 2023 -
lazyscorer-flask Public
Forked from sakthi1307/lazyscorer-flaskPython MIT License UpdatedApr 10, 2023 -
doctr Public
Forked from mindee/doctrdocTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Python Apache License 2.0 UpdatedApr 2, 2023 -
-
-
deskew Public
Forked from sbrunner/deskewLibrary used to deskew a scanned document
Python MIT License UpdatedFeb 10, 2023