wjf5203

Junfeng Wu wjf5203

PhD student, Huazhong University of Science and Technology, Computer Vision

114 followers · 2 following

HUST | Research intern at ByteDance
Wuhan, China
https://wjf5203.github.io/

Achievements

Stars

221 results for source starred repositories

Clear filter

zh460045050 / VQGAN-LC

Python 141 9 Updated Jun 28, 2024

MiroMindAI / MiroThinker

MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 80.8% Avg@8 score on the challenging GAIA benchmark.

Python 6,096 450 Updated Feb 4, 2026

apple / ml-atoken

Jupyter Notebook 115 3 Updated Nov 8, 2025

UCSC-VLAA / OpenVision

[ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning

Python 447 24 Updated Jan 29, 2026

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

Fully Open Framework for Democratized Multimodal Training

Python 716 57 Updated Dec 27, 2025

zhuangshaobin / WeTok

WeTok: Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction

Python 57 2 Updated Sep 3, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,623 512 Updated Feb 4, 2026

kylesargent / FlowMo

Official PyTorch implementation of FlowMo.

Jupyter Notebook 110 7 Updated Apr 7, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,549 441 Updated Sep 14, 2025

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,344 207 Updated May 19, 2025

lmmlzn / Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

1,429 140 Updated Oct 11, 2025

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

4,223 351 Updated Nov 10, 2025

Zjh-819 / LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

3,356 230 Updated Nov 28, 2023

RUCAIBox / awesome-llm-pretraining

Awesome LLM pre-training resources, including data, frameworks, and methods.

323 23 Updated Apr 29, 2025

ali-vilab / alitok

[ICLR2026] AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model

Python 53 2 Updated Oct 12, 2025

apple / ml-flextok

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 290 14 Updated Jun 2, 2025

wyhlovecpp / GPT-Image-Edit

GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset

Python 244 5 Updated Aug 15, 2025

X-Omni-Team / X-Omni

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 420 11 Updated Aug 26, 2025

Hhhhhhao / continuous_tokenizer

Python 304 7 Updated May 29, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,092 300 Updated Jan 5, 2026

JIA-Lab-research / VisionThink

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 448 29 Updated Sep 18, 2025

XinzeZhang / HUST-PhD-Thesis-Latex

华中科技大学博士毕业论文Latex模板

TeX 245 50 Updated Jul 24, 2025

Gen-Verse / MMaDA

MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,570 83 Updated Nov 16, 2025

FoundationVision / Liquid

(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators

Python 637 34 Updated Nov 10, 2025

lxa9867 / Awesome-Autoregressive-Visual-Generation

This is a repo to track the latest autoregressive visual generation papers.

430 5 Updated Jun 25, 2025

youngsheen / SimVQ

[ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer

Python 316 9 Updated Dec 29, 2024

wjf5203 / TokBench

Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.

Python 135 Updated Nov 24, 2025

zhaoyue-zephyrus / bsq-vit

[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 197 6 Updated Dec 18, 2025

willisma / SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 1,092 68 Updated Dec 22, 2025

stepfun-ai / Step-Video-T2V

Python 3,167 335 Updated Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Junfeng Wu wjf5203

Achievements

Achievements

Block or report wjf5203

Stars

zh460045050 / VQGAN-LC

MiroMindAI / MiroThinker

apple / ml-atoken

UCSC-VLAA / OpenVision

EvolvingLMMs-Lab / LLaVA-OneVision-1.5

zhuangshaobin / WeTok

EvolvingLMMs-Lab / lmms-eval

kylesargent / FlowMo

LLaVA-VL / LLaVA-NeXT

google-research / big_vision

lmmlzn / Awesome-LLMs-Datasets

mlabonne / llm-datasets

Zjh-819 / LLMDataHub

RUCAIBox / awesome-llm-pretraining

ali-vilab / alitok

apple / ml-flextok

wyhlovecpp / GPT-Image-Edit

X-Omni-Team / X-Omni

Hhhhhhao / continuous_tokenizer

facebookresearch / flow_matching

JIA-Lab-research / VisionThink

XinzeZhang / HUST-PhD-Thesis-Latex

Gen-Verse / MMaDA

FoundationVision / Liquid

lxa9867 / Awesome-Autoregressive-Visual-Generation

youngsheen / SimVQ

wjf5203 / TokBench

zhaoyue-zephyrus / bsq-vit

willisma / SiT

stepfun-ai / Step-Video-T2V