Stars
💫 Toolkit to help you get started with Spec-Driven Development
[CVPR 2025 Highlight] InPO: Inversion Preference Optimization with Reparametrized DDIM for Efficient Diffusion Model Alignment
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)
Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation
Official Implementation for "Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models" (SIGGRAPH 2023)
Weekly Go Online Meetup via Bilibili|Go 夜读|通过 bilibili 在线直播的方式分享 Go 相关的技术话题,每天大家在微信/telegram/Slack 上及时沟通交流编程技术话题。
talkgo / read
Forked from yangwenmai/learning-golangGo 学习之路:Go 开发者博客、Go 微信公众号、Go 学习资料(文档、书籍、视频)
[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG)
Official implementation of ICCV 2025 paper - CharaConsist: Fine-Grained Consistent Character Generation
DyMO: Training-Free Diffusion Model Alignment with Dynamic Multi-Objective Scheduling
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
[NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT
✨✨Latest Advances on Multimodal Large Language Models
This repository collects an extensive list of awesome papers about Story Generation / Storytelling, exclusively focusing on the era of Large Language Models (LLMs).
SNIFFER: Multimodal Large Language Model for Explainable Out-of-Context Misinformation Detection