Stars
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
We write your reusable computer vision tools. 💜
freeCodeCamp.org's open-source codebase and curriculum. Learn math, programming, and computer science for free.
OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…
Efficiently converts LabelMe's JSON format to the YOLOv5 dataset format.
PPOCRLabelv3 is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
A latent text-to-image diffusion model
Ultralytics YOLOv5 in PyTorch > ONNX > CoreML > TFLite
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
A toolkit for making real world machine learning and data analysis applications in C++
The world's simplest facial recognition api for Python and the command line
Minimal and Clean Reinforcement Learning Examples
Source Han Serif | 思源宋体 | 思源宋體 | 思源宋體 香港 | 源ノ明朝 | 본명조
A tiny, fast Rust CLI that drives a real browser over the Chrome DevTools Protocol — built for coding agents.