Skip to content
View lavinal712's full-sized avatar
🎾
🎾

Block or report lavinal712

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

1284 results for source starred repositories
Clear filter

We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.

Python 81 2 Updated Dec 20, 2025

Native and Compact Structured Latents for 3D Generation

Python 1,819 117 Updated Dec 17, 2025

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 124 6 Updated Dec 15, 2025

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 226 5 Updated Dec 16, 2025

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 444 17 Updated Dec 19, 2025

Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

Python 96 Updated Dec 18, 2025

A framework for efficient model inference with omni-modality models

Python 1,020 141 Updated Dec 20, 2025

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Python 297 3 Updated Dec 19, 2025

A collection of paper/projects that trains flow matching model/policies via RL.

326 10 Updated Dec 9, 2025

Visual Generation Tuning

Python 74 Updated Dec 1, 2025

Train VAE like a boss

Jupyter Notebook 309 13 Updated Oct 21, 2024

dLLM: Simple Diffusion Language Modeling

Python 1,472 149 Updated Dec 19, 2025

Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024

Jupyter Notebook 1,251 128 Updated Jul 14, 2025

The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"

Python 537 33 Updated Dec 11, 2025

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 133 5 Updated Oct 14, 2025

A series of math-specific large language models of our Qwen2 series.

Python 1,054 151 Updated Jan 11, 2025

Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”

Python 134 6 Updated Dec 18, 2025

Easy and Efficient dLLM Fine-Tuning

Python 176 5 Updated Dec 15, 2025

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,290 215 Updated Dec 19, 2025
Python 7,488 442 Updated Dec 14, 2025

Official inference repo for FLUX.2 models

Python 1,243 62 Updated Dec 1, 2025

魔法少女的魔女裁判的文本框脚本,具体使用方式为按下enter自动生成图片并发送

Python 299 55 Updated Dec 20, 2025

Pytorch demo code and models for Multi-HMR

Python 377 30 Updated Nov 6, 2025

Margin-based Vision Transformer

59 2 Updated Nov 28, 2025

SAM 3D Objects

Python 4,994 461 Updated Dec 16, 2025

Depth Anything 3

Python 3,623 308 Updated Dec 12, 2025

HunyuanVideo-1.5: A leading lightweight video generation model

Python 2,010 97 Updated Dec 19, 2025

[ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter

Python 114 8 Updated Dec 5, 2025
Next