Stars
A feature-rich command-line audio/video downloader
High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
The official repository for ERNIE 4.5 and ERNIEKit – its industrial-grade development toolkit based on PaddlePaddle.
番茄小说下载器 - 支持多平台的番茄小说下载工具,提供TXT/EPUB格式转换,GUI界面及GitHub Actions在线下载功能
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
Fast and memory-efficient exact attention
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Enjoy the magic of Diffusion models!
A simple HTML visualization tool for computer vision research 🛠️
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Official repository of In-Context LoRA for Diffusion Transformers
FastAPI framework, high performance, easy to learn, fast to code, ready for production
🐧 在 Linux 上提供一套完整的 Clash / Mihomo(Clash Meta) 代理与管理面板
High-fidelity performance metrics for generative models in PyTorch
🚀 The fast, Pythonic way to build MCP servers and clients
RetDec is a retargetable machine-code decompiler based on LLVM.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
SGLang is a high-performance serving framework for large language models and multimodal models.
Quick scripts to calculate CLIP text-image similarity
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.