Stars
Autonomous experiment loop extension for pi
DFlash: Block Diffusion for Flash Speculative Decoding
Open Source Robotic Arm for All Developers
Implementation of RL-100, Performant Robotic Manipulation with Real-World Reinforcement Learning
Witness the aha moment of VLM with less than $3.
Minimal reproduction of DeepSeek R1-Zero
The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise"
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
🤗 smolagents: a barebones library for agents that think in code.
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
AI Agent Framework, the Pydantic way
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
the simplest self-building general autonomous agent
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Entropy Based Sampling and Parallel CoT Decoding
Text-to-Music Generation with Rectified Flow Transformers
Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"
The ultimate training toolkit for finetuning diffusion models
Open source Claude Artifacts – built with Llama 3.1 405B
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
A curated list of recent diffusion models for video generation, editing, and various other applications.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Official repo for consistency models.
Finetune ModelScope's Text To Video model using Diffusers 🧨
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"