Skip to content
View dotchen's full-sized avatar

Block or report dotchen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,788 868 Updated Jun 10, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,583 2,846 Updated Dec 18, 2025

Post-training with Tinker

Python 2,566 242 Updated Dec 18, 2025

converter that creates three-dimensional models of the world from OpenStreetMap data

Java 663 135 Updated Nov 19, 2025
Jupyter Notebook 1,574 100 Updated Nov 5, 2025

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,973 1,667 Updated Nov 26, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,433 1,988 Updated Nov 1, 2025

[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.

Python 13 1 Updated Aug 8, 2025

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,220 1,184 Updated Dec 18, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,705 12,044 Updated Dec 18, 2025
Python 564 56 Updated Sep 23, 2025

Simple RL training for reasoning

Python 3,808 281 Updated Aug 3, 2025

DeepSeek Coder: Let the Code Write Itself

Python 22,507 2,684 Updated Nov 11, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 157,848 13,952 Updated Dec 18, 2025

A PyTorch native platform for training generative AI models

Python 4,854 644 Updated Dec 18, 2025

[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

Python 1,236 87 Updated Jul 4, 2025

Agile flight done right!

TeX 550 61 Updated Mar 7, 2023

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,126 54 Updated Mar 5, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,230 689 Updated Nov 24, 2025

[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

Python 516 25 Updated Nov 29, 2024

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 19,942 2,823 Updated Oct 17, 2025

TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion Prediction. ICRA 2023. You may also want to check out the updated version: https://github.com/zhejz/TrafficBotsV1.5

Python 71 10 Updated Sep 29, 2023

A JAX-based simulator for autonomous driving research.

Python 1,012 123 Updated Oct 23, 2025

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

Python 566 49 Updated Sep 26, 2024

[ICCV 2023] I-ViT: Integer-only Quantization for Efficient Vision Transformer Inference

Python 193 27 Updated Sep 2, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,158 395 Updated Jul 11, 2024

[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"

Python 137 11 Updated Aug 23, 2025

[ICRA'2024] Rethinking Imitation-based Planner for Autonomous Driving

Python 344 27 Updated Jul 11, 2024
Next