Skip to content
View jbarlow83's full-sized avatar

Organizations

@pikepdf

Block or report jbarlow83

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SGLang is a fast serving framework for large language models and vision language models.

Python 21,882 3,826 Updated Dec 22, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,260 1,257 Updated Dec 20, 2025

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Python 5,475 536 Updated Dec 19, 2025

QGIS Pip Manager plugin allows users to manage Python packages within their QGIS environment

Python 4 Updated Oct 21, 2025

Get your documents ready for gen AI

Python 47,422 3,331 Updated Dec 19, 2025

Platform to build admin panels, internal tools, and dashboards. Integrates with 25+ databases and any API.

TypeScript 38,711 4,406 Updated Dec 19, 2025

Proxmox VE Helper-Scripts (Community Edition)

Shell 24,120 2,174 Updated Dec 22, 2025

Tailscale Docker Proxy

Go 1,409 59 Updated Nov 20, 2025

GloVe and BERT language models re-trained using geoscientific text.

Jupyter Notebook 28 6 Updated Jan 24, 2024

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 18,996 1,300 Updated Oct 21, 2025

A curated list of resources around PDF files

147 14 Updated Aug 2, 2024

A curated list of resources for Document Understanding (DU) topic

1,486 166 Updated Jun 2, 2023

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

HTML 13,460 1,111 Updated Dec 19, 2025

Run Windows apps such as Microsoft Office/Adobe in Linux (Ubuntu/Fedora) and GNOME/KDE as if they were a part of the native OS, including Nautilus integration.

Shell 10,127 448 Updated Aug 18, 2024

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

JavaScript 730 41 Updated Dec 6, 2025

🤖 Just a command runner

Rust 29,371 633 Updated Dec 12, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 16,056 1,539 Updated Jan 19, 2025

Rust task runner and build tool.

Rust 2,874 142 Updated Dec 19, 2025

Your CLI home video recorder 📼

Go 18,014 343 Updated Dec 21, 2025

A more powerful alternative to sysctl(8) with a terminal user interface 🐧

Rust 1,419 26 Updated Dec 1, 2025

This is simple python macro script for LibreOffice to help you generate content from selected words/sentences with OpenAI & Google AI

Python 66 9 Updated Sep 30, 2024

Python bindings for libimagequant

Python 24 4 Updated Dec 16, 2025

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,655 5,856 Updated Dec 21, 2025

LLM inference in C/C++

C++ 91,764 14,182 Updated Dec 21, 2025

PDF OCR Application, adds an OCR text layer to scanned PDF files, allowing them to be copied and searched.

Vue 61 2 Updated Nov 13, 2025

A Qiqqa Test Library / Test Corpus which contains various PDF document samples, etc. collected from live Qiqqa libraries to showcase issues and check regressions in the software.

HTML 9 1 Updated Sep 17, 2023

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 5,705 602 Updated Dec 14, 2025

A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.

C++ 178 44 Updated Jan 10, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,898 2,680 Updated Dec 15, 2025
Next