Skip to content
View ShuaibinLi's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ShuaibinLi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train auto_car in CARLA simulator with RL algorithms(SAC).

Python 114 12 Updated Oct 11, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,577 229 Updated Dec 15, 2025

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 31,181 3,736 Updated Jun 10, 2026

Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 630 48 Updated May 31, 2026

RND1: Scaling Diffusion Language Models

Python 183 12 Updated Feb 22, 2026

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 45,306 5,260 Updated Jun 9, 2026

Collection of reinforcement learning algorithms

Python 2,907 571 Updated Jun 17, 2024
Python 4 Updated Oct 11, 2025

Website for Practical Deep Learning for Coders 2022

Jupyter Notebook 96 28 Updated Jun 24, 2024

An autoregressive character-level language model for making more things

Python 4,010 987 Updated Jun 4, 2024

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,912 4,064 Updated Jun 10, 2026

a-m-team's exploration in large language modeling

196 3 Updated May 29, 2025

Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples

Python 47 1 Updated Jul 16, 2025

Monte Carlo Tree Search Mario AI

Java 31 12 Updated Dec 28, 2013

LLM inference in C/C++

C++ 116,099 19,478 Updated Jun 11, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,913 6,470 Updated Jun 11, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 96,993 14,837 Updated Jun 2, 2026

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 26,173 1,934 Updated Jun 10, 2026

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 21,556 2,565 Updated Jun 30, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,891 3,486 Updated Jun 11, 2026

Proximal Policy Optimization with TensorFlow and OpenAI Gym

Jupyter Notebook 19 5 Updated Mar 31, 2018

Experiments results of PARL

5 5 Updated Jul 5, 2023

Make Fantastic games with pygame!

Python 2 Updated May 7, 2022

Simple framework for image and video deblurring, implemented by PyTorch

Python 346 40 Updated Dec 20, 2023

LaTeX Thesis Template for Tsinghua University

TeX 5,385 1,162 Updated May 27, 2026

Monte carlo tree search in python

Python 629 172 Updated Jul 2, 2022

Python Implementations of Monte Carlo Tree Search

Python 326 88 Updated Aug 20, 2021

A replica of the AlphaZero methodology for deep reinforcement learning in Python

Jupyter Notebook 2,032 750 Updated Nov 21, 2022

An educational resource to help anyone learn deep reinforcement learning.

Python 11,806 2,453 Updated Aug 5, 2024

Python Implementation of Reinforcement Learning: An Introduction

Python 14,676 4,965 Updated Aug 9, 2024
Next