Skip to content
View Txxx926's full-sized avatar

Block or report Txxx926

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenClaw-RL: Train any agent simply by talking

Python 4,803 503 Updated Apr 8, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,070 663 Updated Apr 10, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,237 709 Updated Apr 9, 2026

Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1

TypeScript 51,383 8,379 Updated Apr 7, 2026

DFlash: Block Diffusion for Flash Speculative Decoding

Python 987 62 Updated Apr 8, 2026

Official implementation of "FOCUS: DLLMs Know How to Tame Their Compute Bound".

Python 9 1 Updated Apr 5, 2026

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 846 97 Updated Apr 7, 2026

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,014 130 Updated Apr 10, 2026

torchcomms: a modern PyTorch communications API

C++ 356 125 Updated Apr 11, 2026

The best ChatGPT that $100 can buy.

Python 51,574 6,837 Updated Mar 27, 2026

Fast low-bit matmul kernels in Triton

Python 443 33 Updated Apr 4, 2026

Pipeline parallelism for the minimalist

Python 40 1 Updated Aug 6, 2025
C++ 359 40 Updated Jan 28, 2026

[MobiCom'25] Elastic On-Device LLM Service.

Python 2 1 Updated Sep 7, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,336 915 Updated Apr 11, 2026

Allow torch tensor memory to be released and resumed later

Python 234 48 Updated Mar 10, 2026

Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradient

Python 66 8 Updated Aug 3, 2025

extensible collectives library in triton

Python 98 6 Updated Mar 31, 2025
Python 166 18 Updated Dec 27, 2024

📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/

C++ 25,393 3,091 Updated Aug 17, 2024

Distributed Compiler based on Triton for Parallel Systems

Python 1,403 137 Updated Apr 10, 2026

A PyTorch native platform for training generative AI models

Python 5,221 780 Updated Apr 11, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,582 3,619 Updated Apr 10, 2026

Tile primitives for speedy kernels

Cuda 3,310 274 Updated Apr 8, 2026

CUDA Python: Performance meets Productivity

Cython 3,214 268 Updated Apr 11, 2026

Mamba SSM architecture

Python 17,923 1,687 Updated Apr 10, 2026

[ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo

Python 69 9 Updated Mar 11, 2026

[SenSys'24] PieBridge: Fast and Parameter-Efficient On-Device Training via Proxy Networks

Python 3 Updated Oct 4, 2024

DeepEP: an efficient expert-parallel communication library

Cuda 9,105 1,148 Updated Apr 9, 2026

Official Repo for Open-Reasoner-Zero

Python 2,089 119 Updated Jun 2, 2025
Next