KivenChen

🎯

Focusing

Kiv Chen KivenChen

🎯

Focusing

@cmusv. Broke things @risingwavelabs @bytedance @aliyun @hyperledger and 2 others.

234 followers · 499 following

Mountain View, CA
14:13 (UTC -07:00)

Achievements

x2 x2

Achievements

x2 x2

Starred repositories

kyegomez / OpenMythos

A theoretical reconstruction of the Claude Mythos architecture, built from first principles using the available research literature.

Python 13,810 3,115 Updated May 23, 2026

state-spaces / mamba

Mamba SSM architecture

Python 18,436 1,755 Updated Jun 9, 2026

NVIDIA / cutile-python

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 2,069 140 Updated Jun 13, 2026

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,634 321 Updated Jun 8, 2026

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,999 503 Updated Feb 10, 2026

ai-dynamo / aiconfigurator

Offline optimization of your disaggregated Dynamo graph

Python 335 126 Updated Jun 13, 2026

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,875 600 Updated Jun 13, 2026

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 23,288 2,152 Updated Jan 27, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 54,987 7,492 Updated May 5, 2026

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 2,013 212 Updated Jun 12, 2026

fzyzcjy / torch_memory_saver

Allow torch tensor memory to be released and resumed later

Python 250 58 Updated May 16, 2026

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 888 252 Updated Jun 13, 2026

astral-sh / uv

An extremely fast Python package and project manager, written in Rust.

Rust 86,336 3,197 Updated Jun 13, 2026

prefix-dev / pixi

Powerful system-level package manager for Linux, macOS and Windows written in Rust – building on top of the Conda ecosystem.

Rust 7,279 530 Updated Jun 13, 2026

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 7,373 1,044 Updated Jun 4, 2026

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use [TMLR 2026]

Python 997 83 Updated Jun 8, 2026

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team.

Python 16,616 1,204 Updated Mar 24, 2026

Danau5tin / calculator_agent_rl

Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.

Python 72 7 Updated May 5, 2025

yaof20 / Flash-RL

Implementation for FP8/INT8 Rollout for RL training without performence drop.

Python 303 23 Updated Nov 7, 2025

Tencent / Wechat-YATT

Python 71 7 Updated Jun 8, 2026

NVIDIA / dcgm-exporter

NVIDIA GPU metrics exporter for Prometheus leveraging DCGM

Go 1,764 298 Updated May 12, 2026

KivenChen / rayssh

Python 2 Updated Nov 12, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,109 893 Updated Jun 13, 2026

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,577 229 Updated Dec 15, 2025

smtg-ai / claude-squad

Manage multiple AI terminal agents like Claude Code, Codex, OpenCode, and Amp.

Go 7,798 553 Updated May 18, 2026

pytorch / FBGEMM

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,569 755 Updated Jun 13, 2026

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 6,007 532 Updated May 4, 2026

NVlabs / COAT

[ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Training

Python 265 25 Updated Aug 9, 2025

mlc-ai / xgrammar

Fast, Flexible and Portable Structured Generation

C++ 1,739 153 Updated Jun 11, 2026

ClickHouse / adsb.exposed

Interactive visualization and analytics on ADS-B data with ClickHouse

JavaScript 446 14 Updated Mar 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kiv Chen KivenChen

Achievements

Achievements

Block or report KivenChen

Starred repositories

kyegomez / OpenMythos

state-spaces / mamba

NVIDIA / cutile-python

xdit-project / xDiT

QwenLM / Qwen-Image

ai-dynamo / aiconfigurator

vllm-project / aibrix

deepseek-ai / DeepSeek-OCR

karpathy / nanochat

ByteDance-Seed / VeOmni

fzyzcjy / torch_memory_saver

sgl-project / SpecForge

astral-sh / uv

prefix-dev / pixi

deepseek-ai / DeepGEMM

TIGER-AI-Lab / verl-tool

QwenLM / Qwen3-Coder

Danau5tin / calculator_agent_rl

yaof20 / Flash-RL

Tencent / Wechat-YATT

NVIDIA / dcgm-exporter

KivenChen / rayssh

THUDM / slime

alibaba / Pai-Megatron-Patch

smtg-ai / claude-squad

pytorch / FBGEMM

ByteDance-Seed / Bagel

NVlabs / COAT

mlc-ai / xgrammar

ClickHouse / adsb.exposed

Starred topics

PostgreSQL