Highlights
- Pro
Stars
Codebase for Distillation Robustifies Unlearning
LLM-friendly scraper for media and text from social media and the open web.
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
The simplest, fastest repository for training/finetuning small-sized VLMs.
F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.
Lightweight Python kit for easy multimodal data processing
[⭐ CVPR 2025 Highlight ⭐] Official Implementation of the paper STEREO: A Two-Stage Framework for Adversarially Robust Concept Erasing from Text-to-Image Diffusion Models
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
This is the official implementation of "Vec2Face: Scaling Face Dataset Generation with Loosely Constrained Vectors", which is accepted at ICLR2025.
[ECCV 2024] "Receler: Reliable Concept Erasing of Text-to-Image Diffusion Models via Lightweight Erasers" (Official Implementation)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
The official implementation of the paper "Towards Safe Self-Distillation of Internet-Scale Text-to-Image Diffusion Models" (ICML 2023 Workshop on Challenges in Deployable Generative AI)
A smaller subset of 10 easily classified classes from Imagenet, and a little more French
Open-Sora: Democratizing Efficient Video Production for All
Minimalist Hugo template for academic websites
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
VQA counting task with higher object numbers
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
✨✨Latest Advances on Multimodal Large Language Models
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation