-
05:24
(UTC +08:00)
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
📚 Freely available programming books
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
A high-throughput and memory-efficient inference and serving engine for LLMs
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Free and Open Source Enterprise Resource Planning (ERP)
Convert PDF to markdown + JSON quickly with high accuracy
Official inference framework for 1-bit LLMs
GUI for a Vocal Remover that uses Deep Neural Networks.
Toolkit for linearizing PDFs for LLM datasets/training
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A debugging and profiling tool that can trace and visualize python code execution
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"
Multilingual Document Layout Parsing in a Single Vision-Language Model
📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
AI-powered reverse engineering assistant that bridges IDA Pro with language models through MCP.
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别