Skip to content
View imyangs's full-sized avatar
🕸️
on my way.
🕸️
on my way.

Block or report imyangs

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CVPR2026 Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers

Python 71 4 Updated Mar 12, 2026

A unified framework for easy reinforcement learning in Flow-Matching models

Python 275 17 Updated Mar 23, 2026

[ICLR 2026] "Does FLUX Already Know How to Perform Physically Plausible Image Composition?" (Official Implementation)

Python 132 2 Updated Mar 12, 2026

[CVPR2026 🎉] Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.

Python 746 52 Updated Feb 21, 2026

FireRed-Image-Edit is a powerful image editing foundation model achieving open-source state-of-the-art performance with precise instruction following, high-fidelity generation, superior identity co…

Python 1,107 56 Updated Mar 10, 2026

Implementation of VLM4VLA

Python 143 6 Updated Feb 2, 2026

Streaming Flux editor: live camera→ editing every frames at interactive FPS based on FLUX.2-Klein-4B. Runs on a single H100 at 15+ FPS

Python 61 12 Updated Feb 16, 2026

The ultimate training toolkit for finetuning diffusion models

Python 9,828 1,189 Updated Mar 23, 2026
Python 455 45 Updated Mar 12, 2026

NanoBanana PPT Skills 基于 AI 自动生成高质量 PPT 图片和视频的强大工具,支持智能转场和交互式播放

Python 1,936 238 Updated Jan 19, 2026

Scalable group inference for generating high quality and diverse images with diffusion models.

Python 42 1 Updated Aug 31, 2025

[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark

Python 294 6 Updated Nov 5, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,422 245 Updated Mar 6, 2026

[CVPR 2026] PersonaLive! : Expressive Portrait Image Animation for Live Streaming

Python 2,474 335 Updated Mar 5, 2026

[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻

Python 496 26 Updated Feb 24, 2026

[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Python 350 22 Updated Dec 31, 2025

Visual Generation Tuning

Python 99 Updated Jan 27, 2026

[ICLR 2026] ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Python 679 41 Updated Nov 20, 2025

Official Implementation of "MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation"

Python 295 8 Updated Jan 29, 2026

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,594 172 Updated Mar 23, 2026

聚宽网量化投资策略编写的学习文档

71 15 Updated Apr 25, 2018

[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 734 29 Updated Feb 10, 2026

OpenVision (ICCV 2025), OpenVision 2 (CVPR 2026), and OpenVision 3

Python 469 24 Updated Feb 21, 2026

Collection of extracted System Prompts from popular chatbots like ChatGPT, Claude & Gemini

HTML 34,889 5,618 Updated Mar 22, 2026

All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.

Python 1,379 66 Updated Mar 23, 2026

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 38,918 7,271 Updated Mar 22, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,600 463 Updated Feb 10, 2026

T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)

Jupyter Notebook 46 2 Updated Oct 6, 2025

[ICCV 2025] Hybrid Layout Control for Diffusion Transformer: Fewer Annotations, Superior Aesthetics.

Jupyter Notebook 18 Updated Oct 23, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,565 2,146 Updated Sep 12, 2025
Next