Stars
Fast and memory-efficient exact attention
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
Enjoy the magic of Diffusion models!
A simple HTML visualization tool for computer vision research 🛠️
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Official repository of In-Context LoRA for Diffusion Transformers
FastAPI framework, high performance, easy to learn, fast to code, ready for production
High-fidelity performance metrics for generative models in PyTorch
🚀 The fast, Pythonic way to build MCP servers and clients
RetDec is a retargetable machine-code decompiler based on LLVM.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
SGLang is a fast serving framework for large language models and vision language models.
Quick scripts to calculate CLIP text-image similarity
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version in translation
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Official implementation of the AAAI2024 paper: Open-Set Facial Expression Recognition
State-of-the-art 2D and 3D Face Analysis Project