Skip to content
View qsh-zh's full-sized avatar

Highlights

  • Pro

Block or report qsh-zh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 1,626 277 Updated Dec 15, 2025

爬抖音,爬取别人的美好生活

Python 804 263 Updated Dec 8, 2022

nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)

Python 138 9 Updated May 8, 2025

A prototype implementation of the "dataset as a queue" pattern for processing web pages into interleaved image/text content.

Python 29 Updated Nov 16, 2025

Ongoing research training transformer models at scale

Python 14,714 3,414 Updated Dec 25, 2025

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 215 37 Updated Dec 25, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,933 579 Updated Oct 31, 2025

This repo contains the code for 1D tokenizer and generator

Jupyter Notebook 1,086 59 Updated Mar 20, 2025

This is the official repo for the paper "LongCat-Flash-Omni Technical Report"

Python 447 25 Updated Dec 15, 2025

Simple IO APIs with pluggable storage backends and rich format handlers.

Python 4 Updated Oct 31, 2025

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Python 1,138 112 Updated Aug 21, 2025
Python 153 14 Updated Dec 27, 2024

Tiny AutoEncoder for Hunyuan Video (and other video models)

Python 256 5 Updated Dec 17, 2025

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 248 37 Updated Dec 25, 2025

RES: Refined Exponential Solver. https://arxiv.org/abs/2308.02157

Python 1 Updated Aug 24, 2025
Python 874 64 Updated Dec 13, 2025

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Python 496 14 Updated Sep 2, 2024

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

Python 567 32 Updated Nov 11, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 611 32 Updated Dec 9, 2025

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 143 29 Updated Dec 19, 2025

Dion optimizer algorithm

Python 410 42 Updated Dec 23, 2025

Evaluation harness for diffusion world models

TypeScript 13 4 Updated Aug 13, 2025

A place to store reusable transformer components of my own creation or found on the interwebs

Python 63 10 Updated Dec 13, 2025

kernels, of the mega variety

Python 634 34 Updated Sep 28, 2025

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 691 95 Updated Oct 29, 2025

open-source coding LLM for software engineering tasks

Python 1,078 129 Updated Sep 30, 2025

[ICML 2025 Spotlight] Direct Discriminative Optimization: Supercharging Diffusion/Autoregressive with GAN-type Discrimination

Python 109 3 Updated Jul 31, 2025

The official implementation of CVPR'25 Oral paper "Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise"

Python 1,055 49 Updated Oct 13, 2025

Quickly rewrite git repository history (filter-branch replacement)

Python 11,387 900 Updated Dec 2, 2025
Next