Skip to content
View lavinal712's full-sized avatar
🎾
🎾

Block or report lavinal712

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.

Python 52 1 Updated Dec 19, 2025

Native and Compact Structured Latents for 3D Generation

Python 1,711 107 Updated Dec 17, 2025

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 117 6 Updated Dec 15, 2025

Towards Scalable Pre-training of Visual Tokenizers for Generation

Python 208 5 Updated Dec 16, 2025

[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Python 439 16 Updated Dec 19, 2025

Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

Python 94 Updated Dec 18, 2025

A framework for efficient model inference with omni-modality models

Python 1,003 136 Updated Dec 19, 2025

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Python 297 3 Updated Dec 19, 2025

A collection of paper/projects that trains flow matching model/policies via RL.

326 10 Updated Dec 9, 2025

Visual Generation Tuning

Python 73 Updated Dec 1, 2025

Train VAE like a boss

Jupyter Notebook 309 13 Updated Oct 21, 2024

dLLM: Simple Diffusion Language Modeling

Python 1,447 145 Updated Dec 19, 2025

Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024

Jupyter Notebook 1,248 128 Updated Jul 14, 2025

The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"

Python 535 33 Updated Dec 11, 2025

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 133 5 Updated Oct 14, 2025

Project Sekai sticker maker

JavaScript 503 117 Updated Jan 14, 2024

A series of math-specific large language models of our Qwen2 series.

Python 1,054 151 Updated Jan 11, 2025

Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”

Python 134 6 Updated Dec 18, 2025

Easy and Efficient dLLM Fine-Tuning

Python 175 5 Updated Dec 15, 2025

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,265 214 Updated Dec 18, 2025
Python 7,420 434 Updated Dec 14, 2025

Official inference repo for FLUX.2 models

Python 1,240 62 Updated Dec 1, 2025

魔法少女的魔女裁判的文本框脚本,具体使用方式为按下enter自动生成图片并发送

Python 298 55 Updated Dec 17, 2025

Pytorch demo code and models for Multi-HMR

Python 377 30 Updated Nov 6, 2025

Margin-based Vision Transformer

59 2 Updated Nov 28, 2025

SAM 3D Objects

Python 4,971 456 Updated Dec 16, 2025

Depth Anything 3

Python 3,607 308 Updated Dec 12, 2025

HunyuanVideo-1.5: A leading lightweight video generation model

Python 1,997 97 Updated Dec 19, 2025
Next