Skip to content
View limuloo's full-sized avatar
🐒
Focusing
🐒
Focusing

Block or report limuloo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Unified Video Editing with Temporal Reasoner

Python 90 5 Updated Dec 17, 2025

ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation

Python 43 Updated Dec 16, 2025

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Python 655 25 Updated Oct 25, 2024

[NIPS 2025] Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control

Python 34 3 Updated Dec 2, 2025

Official implementation of DepthLM

Python 275 12 Updated Oct 7, 2025
Python 7,574 445 Updated Dec 14, 2025
HTML 5 Updated Nov 25, 2025

(NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps

Python 23 2 Updated Nov 12, 2025

Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 335 11 Updated Dec 22, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,447 362 Updated Dec 19, 2025

Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"

Python 422 22 Updated Jun 20, 2025

High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning

52 Updated Jul 23, 2025

Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"

Jupyter Notebook 297 11 Updated Sep 28, 2025

[ACMMM 2025] "Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted Concepts" (Official Implementation)

Python 77 1 Updated Jul 5, 2025

A Collection of Papers and Codes for CVPR2025/ICCV2025/CVPR2024/ECCV2024 AIGC

616 17 Updated Oct 30, 2025

Calligrapher: Freestyle Text Image Customization

Python 295 22 Updated Sep 3, 2025

Unified layout planning and image generation, ICCV2025

Python 39 1 Updated Apr 14, 2025

This repository open-sources CreatiPoster, an AI-driven graphic design generation system for multi-layer and editable compositions with strong visual appeal.

73 2 Updated Jun 14, 2025
Python 32 2 Updated Jun 13, 2025

ControlThinker: Unveiling Latent Semantics for Controllable Image Generation through Visual Reasoning

Python 8 1 Updated Aug 11, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,705 127 Updated Jul 25, 2025

A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Python 135 11 Updated Jun 5, 2025

[NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models

Python 109 5 Updated Sep 27, 2025

Layout Conditioned Image Generation, NeurIPS2024

Python 64 3 Updated Sep 3, 2025

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Python 821 25 Updated Nov 25, 2025

Official inference repo for FLUX.1 models

Python 24,937 1,829 Updated Jul 31, 2025

[CVPRW 2022] MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment

Python 394 44 Updated Jun 10, 2023

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 630 45 Updated Nov 10, 2025

Train transformer language models with reinforcement learning.

Python 16,728 2,371 Updated Dec 22, 2025

Open-source unified multimodal model

Python 5,493 480 Updated Oct 27, 2025
Next