Skip to content
View dotchen's full-sized avatar

Block or report dotchen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WorldEngine: Towards the Era of Post-Training for Physical AI

Python 273 13 Updated Apr 24, 2026

An Extensible Deep Learning Library

Python 2,351 403 Updated Apr 16, 2026

Strategic research thinking agents for Claude Code — idea evaluation, project triage, and structured brainstorming. Helps you decide which papers to write, not just how to write them.

632 56 Updated Apr 13, 2026

AI agents running research on single-GPU nanochat training automatically

Python 77,630 11,323 Updated Mar 26, 2026

My learning notes for ML SYS.

Python 6,148 401 Updated Apr 23, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 365,998 75,038 Updated Apr 29, 2026

This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision". SpidR is a self-supervised speech representat…

Python 57 6 Updated Apr 26, 2026

The missing tiktoken training code

Rust 444 47 Updated Jan 3, 2026

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,898 872 Updated Jun 10, 2024

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 20,998 3,762 Updated Apr 29, 2026

Post-training with Tinker

Python 3,181 402 Updated Apr 29, 2026

converter that creates three-dimensional models of the world from OpenStreetMap data

Java 747 140 Updated Feb 19, 2026
Jupyter Notebook 1,810 114 Updated Nov 5, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 21,088 1,796 Updated Mar 5, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,051 2,069 Updated Mar 27, 2026

[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.

Python 14 1 Updated Aug 8, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,090 1,278 Updated Apr 27, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,510 16,223 Updated Apr 29, 2026
Python 598 67 Updated Sep 23, 2025

Simple RL training for reasoning

Python 3,850 289 Updated Dec 23, 2025

DeepSeek Coder: Let the Code Write Itself

Python 23,178 2,790 Updated Nov 11, 2025

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 170,257 15,868 Updated Apr 29, 2026

A PyTorch native platform for training generative AI models

Python 5,279 801 Updated Apr 29, 2026

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Python 1,324 102 Updated Jul 4, 2025

Agile flight done right!

TeX 595 66 Updated Mar 7, 2023

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,313 71 Updated Mar 5, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,488 752 Updated Nov 24, 2025

[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

Python 564 27 Updated Nov 29, 2024
Next