Skip to content
View wangyongliang's full-sized avatar
  • Alibaba
  • Beijing, China

Block or report wangyongliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,461 69 Updated Mar 16, 2025

Parallel computing with task scheduling

Python 13,657 1,826 Updated Dec 17, 2025

Official inference repo for FLUX.2 models

Python 1,233 63 Updated Dec 1, 2025

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,414 206 Updated Nov 12, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,105 63 Updated Aug 7, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,760 103 Updated Nov 4, 2025

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.

Python 282 18 Updated Dec 17, 2025

Open-source unified multimodal model

Python 5,475 480 Updated Oct 27, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,364 407 Updated Jun 28, 2024

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 791 44 Updated Aug 30, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,016 237 Updated Nov 30, 2025

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,587 117 Updated May 29, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,559 551 Updated Nov 10, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,882 874 Updated Jul 18, 2024
Python 7,311 424 Updated Dec 14, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,719 2,348 Updated Dec 17, 2025

Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Python 1,866 211 Updated Jun 10, 2024

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 26,424 2,744 Updated Dec 16, 2025

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Python 238 5 Updated Aug 15, 2025

HyperGen - Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.

Python 1,309 114 Updated Oct 21, 2025

Universal memory layer for AI Agents

Python 44,420 4,825 Updated Dec 17, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 73,885 8,835 Updated Dec 18, 2025

Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"

Python 322 16 Updated Dec 16, 2025

SQL Native Memory Layer for LLMs, AI Agents & Multi-Agent Systems

Python 11,110 724 Updated Dec 17, 2025

This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.

133 6 Updated Sep 23, 2024

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 2,168 191 Updated Sep 26, 2025

Discomfort: Control ComfyUI with Python

Python 263 31 Updated Oct 17, 2025

[ACM TIST 2025] GenAI in Fashion: Overview, also includes 🔥latest papers, ⚙️metrics, 👀workshops, 🚀companies & products, ...)

202 13 Updated Dec 10, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,503 244 Updated Oct 17, 2025

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 180 4 Updated Nov 21, 2025
Next