A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,879 280 Updated Sep 25, 2025

dvlab-research / VisionThink

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 439 29 Updated Sep 18, 2025

XinzeZhang / HUST-PhD-Thesis-Latex

华中科技大学博士毕业论文Latex模板

TeX 229 48 Updated Jul 24, 2025

Gen-Verse / MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,533 78 Updated Nov 16, 2025

FoundationVision / Liquid

(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

Python 635 34 Updated Nov 10, 2025

lxa9867 / Awesome-Autoregressive-Visual-Generation

This is a repo to track the latest autoregressive visual generation papers.

420 5 Updated Jun 25, 2025

youngsheen / SimVQ

[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python 311 8 Updated Dec 29, 2024

wjf5203 / TokBench

Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.

Python 133 Updated Nov 24, 2025

zhaoyue-zephyrus / bsq-vit

[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 188 6 Updated Dec 18, 2025

willisma / SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 1,052 64 Updated Nov 4, 2025

stepfun-ai / Step-Video-T2V

Python 3,140 328 Updated Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Junfeng Wu wjf5203

Achievements

Achievements

Block or report wjf5203

Stars

zh460045050 / VQGAN-LC

MiroMindAI / MiroThinker

apple / ml-atoken

UCSC-VLAA / OpenVision

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

zhuangshaobin / WeTok

EvolvingLMMs-Lab / lmms-eval

kylesargent / FlowMo

LLaVA-VL / LLaVA-NeXT

google-research / big_vision

lmmlzn / Awesome-LLMs-Datasets

mlabonne / llm-datasets

Zjh-819 / LLMDataHub

RUCAIBox / awesome-llm-pretraining

ali-vilab / alitok

apple / ml-flextok

wyhlovecpp / GPT-Image-Edit

X-Omni-Team / X-Omni

Hhhhhhao / continuous_tokenizer

facebookresearch / flow_matching