Skip to content
View wangyongliang's full-sized avatar
  • Alibaba
  • Beijing, China

Block or report wangyongliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,458 68 Updated Mar 16, 2025

Parallel computing with task scheduling

Python 13,655 1,827 Updated Dec 17, 2025

Official inference repo for FLUX.2 models

Python 1,227 63 Updated Dec 1, 2025

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,414 206 Updated Nov 12, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,100 63 Updated Aug 7, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,757 103 Updated Nov 4, 2025

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.

Python 279 18 Updated Dec 17, 2025

Open-source unified multimodal model

Python 5,467 478 Updated Oct 27, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,362 406 Updated Jun 28, 2024

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 791 44 Updated Aug 30, 2025

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,015 236 Updated Nov 30, 2025

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,587 117 Updated May 29, 2025

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,559 551 Updated Nov 10, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,883 874 Updated Jul 18, 2024
Python 7,252 420 Updated Dec 14, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,706 2,347 Updated Dec 17, 2025

Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Python 1,866 211 Updated Jun 10, 2024

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 26,422 2,743 Updated Dec 16, 2025

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Python 238 5 Updated Aug 15, 2025

HyperGen - Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.

Python 1,309 114 Updated Oct 21, 2025

Universal memory layer for AI Agents

Python 44,382 4,825 Updated Dec 17, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 73,854 8,835 Updated Dec 17, 2025

Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"

Python 320 16 Updated Dec 16, 2025

SQL Native Memory Layer for LLMs, AI Agents & Multi-Agent Systems

Python 11,073 724 Updated Dec 17, 2025

This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.

133 6 Updated Sep 23, 2024

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 2,168 190 Updated Sep 26, 2025

Discomfort: Control ComfyUI with Python

Python 263 31 Updated Oct 17, 2025

[ACM TIST 2025] GenAI in Fashion: Overview, also includes 🔥latest papers, ⚙️metrics, 👀workshops, 🚀companies & products, ...)

202 13 Updated Dec 10, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,501 244 Updated Oct 17, 2025

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 180 4 Updated Nov 21, 2025
Next