wangyongliang

Follow

Yongliang Wang wangyongliang

Follow

~

33 followers · 171 following

Alibaba
Beijing, China

Achievements

Achievements

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Starred repositories

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,461 69 Updated Mar 16, 2025

dask / dask

Parallel computing with task scheduling

Python 13,657 1,826 Updated Dec 17, 2025

black-forest-labs / flux2

Official inference repo for FLUX.2 models

Python 1,233 63 Updated Dec 1, 2025

Yutong-Zhou-cv / Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,414 206 Updated Nov 12, 2025

tianweiy / CausVid

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,105 63 Updated Aug 7, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,760 103 Updated Nov 4, 2025

yejy53 / RealGen

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.

Python 282 18 Updated Dec 17, 2025

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 5,475 480 Updated Oct 27, 2025

tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 6,364 407 Updated Jun 28, 2024

PKU-YuanGroup / ConsisID

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 791 44 Updated Aug 30, 2025

hkchengrex / MMAudio

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 2,016 237 Updated Nov 30, 2025

showlab / ShowUI

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,587 117 Updated May 29, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,559 551 Updated Nov 10, 2025

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,882 874 Updated Jul 18, 2024

Tongyi-MAI / Z-Image

Python 7,311 424 Updated Dec 14, 2025

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,719 2,348 Updated Dec 17, 2025

AutoViML / AutoViz

Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.

Python 1,866 211 Updated Jun 10, 2024

invoke-ai / InvokeAI

Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The …

TypeScript 26,424 2,744 Updated Dec 16, 2025

wyhlovecpp / GPT-Image-Edit

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Python 238 5 Updated Aug 15, 2025

0xCrunchyy / hypergen

HyperGen - Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.

Python 1,309 114 Updated Oct 21, 2025

mem0ai / mem0

Universal memory layer for AI Agents

Python 44,420 4,825 Updated Dec 17, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 73,885 8,835 Updated Dec 18, 2025

HKUSTDial / awesome-data-agents

Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"

Python 322 16 Updated Dec 16, 2025

MemoriLabs / Memori

SQL Native Memory Layer for LLMs, AI Agents & Multi-Agent Systems

Python 11,110 724 Updated Dec 17, 2025

HqWu-HITCS / Awesome-Personalized-LLM

This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.

133 6 Updated Sep 23, 2024

pydn / ComfyUI-to-Python-Extension

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 2,168 191 Updated Sep 26, 2025

Distillery-Dev / Discomfort

Discomfort: Control ComfyUI with Python

Python 263 31 Updated Oct 17, 2025

wendashi / Cool-GenAI-Fashion-Papers

[ACM TIST 2025] GenAI in Fashion: Overview, also includes 🔥latest papers, ⚙️metrics, 👀workshops, 🚀companies & products, ...)

202 13 Updated Dec 10, 2025

ali-vilab / VACE

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,503 244 Updated Oct 17, 2025

VectorSpaceLab / EditScore

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Python 180 4 Updated Nov 21, 2025

Starred topics

llms

large-language-models

reinforcement-learning

ai-agents

genai

Artificial Intelligence

artificial-inteligence

agents

Large Language Model

Deep learning

See all starred topics