Skip to content
View RITCHIEHuang's full-sized avatar

Block or report RITCHIEHuang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

My learning notes/codes for ML SYS.

Python 3,830 232 Updated Oct 6, 2025

Nano vLLM

Python 7,009 891 Updated Aug 31, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 7,924 787 Updated Sep 19, 2025

Efficient Triton Kernels for LLM Training

Python 5,731 413 Updated Oct 10, 2025

Code repository for the paper - "Matryoshka Representation Learning"

Jupyter Notebook 566 34 Updated Feb 19, 2024

An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…

Python 764 94 Updated Mar 13, 2025
Python 882 90 Updated Oct 9, 2025

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 78,524 8,517 Updated Oct 10, 2025

🧑‍🚀 全世界最好的LLM资料总结(语音视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.

6,307 617 Updated Oct 9, 2025

Text-audio foundation model from Boson AI

Python 7,419 537 Updated Sep 15, 2025

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 84 7 Updated Aug 20, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,221 2,252 Updated Sep 24, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,091 200 Updated Oct 10, 2025

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 151 47 Updated Aug 28, 2025

Collect the awesome works evolved around reasoning models like O1/R1 in visual domain

41 1 Updated Jul 21, 2025

量化代码

Python 315 96 Updated Aug 10, 2025

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 18,931 1,634 Updated Oct 7, 2025
Python 855 50 Updated Sep 3, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,213 95 Updated Jul 22, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,402 2,260 Updated Oct 5, 2025

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 15,805 1,265 Updated Jan 18, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,667 171 Updated Oct 4, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,574 68 Updated May 11, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,141 2,519 Updated Oct 10, 2025

Awesome RL-based LLM Reasoning

639 34 Updated Jul 19, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,868 304 Updated Mar 10, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,784 710 Updated Oct 9, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,921 285 Updated May 15, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,588 947 Updated Oct 10, 2025

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 568 16 Updated Apr 13, 2025
Next