Skip to content
View cherhh's full-sized avatar

Block or report cherhh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DreamX-World: A General-Purpose Interactive World Model

Python 480 27 Updated Jun 16, 2026
Python 12 2 Updated Jun 15, 2026
Python 3 Updated Jun 14, 2026
Python 335 29 Updated Jun 15, 2026

[ICML2026] Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Python 54 4 Updated Jun 4, 2026

verl Zero-Mismatch Dense/MoE HuggingFace Rollout

Python 53 5 Updated Jun 11, 2026

Muon in Int8 Precision Made Possible

Python 19 1 Updated Jun 18, 2026

Agent-friendly GPU profile-query CLI

Rust 85 2 Updated Jun 19, 2026

A Claude Code skill for creating, compiling, reviewing, and polishing academic Beamer LaTeX presentations. Full lifecycle workflow with quality scoring, pedagogical review, TikZ audit, and more.

TeX 275 13 Updated Jun 15, 2026

[CVPR 2026] CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning

Python 12 Updated May 15, 2026

InfiniCCL is a unified, cross-platform collective communication library designed for heterogeneous accelerator environments.

C++ 13 3 Updated Jun 17, 2026

high-performance inference and serving library for interactive autoregressive video and world models

Python 337 22 Updated Jun 20, 2026

Official Codebase for "DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos" (ICML 2026)

Python 947 64 Updated Mar 21, 2026

Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.

Python 1,324 153 Updated Jun 20, 2026

Official Code of NAVA: Native Audio-Visual Alignment for Generation.

Python 199 22 Updated Jun 15, 2026
Python 16 1 Updated May 27, 2026

Official implementation of “Domino: Decoupling Causal Modeling from Autoregressive Drafting in Speculative Decoding”.

Python 69 3 Updated Jun 10, 2026
Python 2 Updated May 26, 2026

mKernel: fast multi-node, multi-GPU fused kernels

Cuda 239 22 Updated Jun 8, 2026

StreamDiffusion, Live Stream APP

Python 499 57 Updated May 19, 2026

CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs

Python 214 22 Updated Jun 20, 2026
Python 257 31 Updated Jun 9, 2026

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Python 216 9 Updated May 30, 2026

A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models

Python 612 11 Updated Jun 15, 2026

Official implementation of Paper "System-Aware 4-Bit KV-Cache Quantization for Real-World LLM Serving"

Shell 27 3 Updated Apr 17, 2026

CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention

Python 7 1 Updated May 18, 2026

Triton kernels and PyTorch ops for Block Attention Residuals (AttnRes)

Python 83 6 Updated May 29, 2026

Hadamard transformation kernels written by cutedsl

4 Updated May 20, 2026

Dataflow-Oriented Reinforcement Learning for (Multi-)Agentic LLMs

Python 89 15 Updated Jun 19, 2026
Next