Skip to content
View dotchen's full-sized avatar

Block or report dotchen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WorldEngine: Towards the Era of Post-Training for Physical AI

Python 275 14 Updated Apr 24, 2026

An Extensible Deep Learning Library

Python 2,351 403 Updated Apr 16, 2026

Strategic research thinking agents for Claude Code — idea evaluation, project triage, and structured brainstorming. Helps you decide which papers to write, not just how to write them.

633 56 Updated Apr 13, 2026

AI agents running research on single-GPU nanochat training automatically

Python 78,040 11,379 Updated Mar 26, 2026

My learning notes for ML SYS.

Python 6,159 401 Updated Apr 23, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 366,665 75,276 Updated Apr 30, 2026

This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without Supervision". SpidR is a self-supervised speech representat…

Python 57 6 Updated Apr 26, 2026

The missing tiktoken training code

Rust 445 48 Updated Jan 3, 2026

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,899 873 Updated Jun 10, 2024

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,031 3,777 Updated Apr 30, 2026

Post-training with Tinker

Python 3,189 404 Updated Apr 30, 2026

converter that creates three-dimensional models of the world from OpenStreetMap data

Java 747 140 Updated Feb 19, 2026
Jupyter Notebook 1,814 115 Updated Nov 5, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 21,097 1,796 Updated Mar 5, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 20,060 2,072 Updated Mar 27, 2026

[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.

Python 14 1 Updated Aug 8, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 17,098 1,279 Updated Apr 30, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 78,674 16,280 Updated Apr 30, 2026
Python 598 67 Updated Sep 23, 2025

Simple RL training for reasoning

Python 3,851 289 Updated Dec 23, 2025

DeepSeek Coder: Let the Code Write Itself

Python 23,198 2,795 Updated Nov 11, 2025

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 170,387 15,897 Updated Apr 30, 2026

A PyTorch native platform for training generative AI models

Python 5,286 802 Updated Apr 30, 2026

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Python 1,324 101 Updated Jul 4, 2025

Agile flight done right!

TeX 595 66 Updated Mar 7, 2023

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,315 71 Updated Mar 5, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,489 752 Updated Nov 24, 2025

[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

Python 565 27 Updated Nov 29, 2024
Next