Skip to content
View lxa9867's full-sized avatar

Block or report lxa9867

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,192 178 Updated Jan 20, 2026

OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing

Python 36 Updated Jan 9, 2026

NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation

306 15 Updated Jan 9, 2026

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 1,112 90 Updated Jan 13, 2026

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Python 300 3 Updated Dec 21, 2025

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 565 47 Updated Oct 29, 2025

Native Multimodal Models are World Learners

Python 1,451 56 Updated Dec 30, 2025

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 349 48 Updated Jul 21, 2025

Official repository for BrickGPT, the first approach for generating physically stable toy brick models from text prompts.

Python 1,594 98 Updated Feb 7, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,753 65 Updated Jan 20, 2026

[NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis

Python 111 5 Updated Nov 3, 2025

Image Tokenizer Needs Post-Training

Python 24 2 Updated Oct 4, 2025

[ICLR 2026] Code for our paper "Next Visual Granularity Generation".

Python 49 1 Updated Jan 26, 2026

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

881 106 Updated Aug 27, 2025
Python 126 5 Updated Aug 10, 2025

[🚀 ICLR 2026 Oral]NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.

Python 602 18 Updated Dec 25, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,241 422 Updated Dec 31, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,085 1,688 Updated Dec 17, 2025

Test-time Scaling for VAR models

Python 31 4 Updated Sep 19, 2025

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 172 4 Updated Dec 17, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,122 238 Updated Sep 12, 2025

Code and dataset link for "DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World"

122 2 Updated Oct 2, 2025

[NeurIPS 2025] Efficient Reasoning Vision Language Models

Python 448 29 Updated Sep 18, 2025
Python 81 Updated Oct 18, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,389 1,341 Updated Jul 9, 2025

[NeurIPS 2025] Geometry Aware Operator Transformer As An Efficient And Accurate Neural Surrogate For PDEs On Arbitrary Domains

Python 73 19 Updated Oct 23, 2025

Train transformer language models with reinforcement learning.

Python 17,306 2,477 Updated Feb 7, 2026

An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation

Python 1,501 75 Updated Oct 16, 2025

🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"

Python 165 8 Updated Jul 10, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,077 208 Updated Dec 21, 2025
Next