lavinal712

🎾

Yuqian Hong lavinal712

🎾

Master degree candidate of USTC

43 followers · 201 following

Achievements

Starred repositories

1284 results for source starred repositories

Clear filter

Francis-Rings / FlashPortrait

We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.

Python 81 2 Updated Dec 20, 2025

microsoft / TRELLIS.2

Native and Compact Structured Latents for 3D Generation

Python 1,819 117 Updated Dec 17, 2025

End2End-Diffusion / iREPA

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 124 6 Updated Dec 15, 2025

MiniMax-AI / VTP

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 226 5 Updated Dec 16, 2025

ali-vilab / Wan-Move

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 444 17 Updated Dec 19, 2025

KlingTeam / SVG-T2I

Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

Python 96 Updated Dec 18, 2025

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 1,020 141 Updated Dec 20, 2025

yuemingPAN / SFD

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Python 297 3 Updated Dec 19, 2025

Tonghe-Zhang / Awesome-Flow-RL-Papers

A collection of paper/projects that trains flow matching model/policies via RL.

326 10 Updated Dec 9, 2025

hustvl / VGT

Visual Generation Tuning

Python 74 Updated Dec 1, 2025

cloneofsimo / vqgan-training

Train VAE like a boss

Jupyter Notebook 309 13 Updated Oct 21, 2024

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 1,472 149 Updated Dec 19, 2025

zju3dv / GVHMR

Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024

Jupyter Notebook 1,251 128 Updated Jul 14, 2025

bytedance / vidi

The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"

Python 537 33 Updated Dec 11, 2025

inclusionAI / Ming-UniVision

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 133 5 Updated Oct 14, 2025

QwenLM / Qwen2.5-Math

A series of math-specific large language models of our Qwen2 series.

Python 1,054 151 Updated Jan 11, 2025

MarkCup-Official / Anan-s-Sketchbook-Chat-Box

Python 168 28 Updated Nov 14, 2025

Zehong-Ma / DeCo

Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”

Python 134 6 Updated Dec 18, 2025

deepseek-ai / DeepSeek-Math-V2

Python 1,491 118 Updated Dec 1, 2025

inclusionAI / dFactory

Easy and Efficient dLLM Fine-Tuning

Python 176 5 Updated Dec 15, 2025

facebookresearch / sam-3d-body

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,290 215 Updated Dec 19, 2025