Stars
An extremely fast Python package and project manager, written in Rust.
Benchmarking Knowledge Transfer in Lifelong Robot Learning
Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).
H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation
[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
[ECCV 2024] "Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers" (Official Implementation)
MC$^2$: Multi-concept Guidance for Customized Multi-concept Generation
Control SD with Llama2 for concept regularization
Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models". This work adversarially unlearns the text encoder to enh…
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Official code of SmartEdit [CVPR-2024 Highlight]
A collection of awesome text-to-image generation studies.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"
[CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)
Stable Diffusion web UI
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".
The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and e…
[AAAI 2024] Official Implementation of Language-Guided Transformer for Federated Multi-Label Classification
[ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementation)
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
[CVPR2024, Highlight] Official code for DragDiffusion
[AAAI2023] Token Mixing: Parameter-Efficient Transfer Learning from Image-Language to Video-Language