Stars
[CVPR 2025] Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"
End2End Virtual Try-on with Visual Reference, CVPR2026
A repository for organizing papers, codes and other resources related to Virtual Try-on Models
Learning on the Job: An Experience-Driven, Self-Evolving Agent for Long-Horizon Tasks
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
Helios: Real Real-Time Long Video Generation Model
Open source alternative to NotebookLM for teams. Join our Discord: https://discord.gg/ejRNvftDp9
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
PaperBanana: Automating Academic Illustration For AI Scientists
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Translate the video from one language to another and embed dubbing & subtitles.
Tongyi Deep Research, the Leading Open-source Deep Research Agent
An autonomous agent that conducts deep research on any data using any LLM providers
Open Multi-Agent Interactive Classroom — Get an immersive, multi-agent learning experience in just one click
Vietnamese TTS with instant voice cloning • On-device • Real-time CPU inference • 24kHz audio quality • Chuyển văn bản thành giọng nói tiếng Việt • Text to speech tiếng Việt • TTS tiếng Việt
Enjoy the magic of Diffusion models!
A general fine-tuning kit geared toward image/video/audio diffusion models.
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
NeRA adapter introduced in "Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel NeRA Adapter for Enhanced Feature Adaptation" paper at WACV-2026
[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …
OpenClaw: Use All Major AI Models NO API Token! Claude/ChatGPT/Gemini/DeepSeek/Doubao/Grok/Qwen/Manus/Kimi
[AAAI 2025] Official implementation of "OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on"