Stars
Stable Diffusion web UI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
A high-throughput and memory-efficient inference and serving engine for LLMs
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Get your documents ready for gen AI
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Official inference repo for FLUX.1 models
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.
A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API