ShuaibinLi

🎯

Focusing

Happy ShuaibinLi

🎯

Focusing

17 followers · 7 following

Achievements

Stars

ShuaibinLi / RL_CARLA

Train auto_car in CARLA simulator with RL algorithms(SAC).

Python 114 12 Updated Oct 11, 2025

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,577 229 Updated Dec 15, 2025

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 31,181 3,736 Updated Jun 10, 2026

pengzhangzhi / Open-dLLM

Open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 630 48 Updated May 31, 2026

RadicalNumerics / RND1

RND1: Scaling Diffusion Language Models

Python 183 12 Updated Feb 22, 2026

anthropics / claude-cookbooks

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 45,306 5,260 Updated Jun 9, 2026

rail-berkeley / rlkit

Collection of reinforcement learning algorithms

Python 2,907 571 Updated Jun 17, 2024

ShuaibinLi / ESBox

Python 4 Updated Oct 11, 2025

fastai / course22-web

Website for Practical Deep Learning for Coders 2022

Jupyter Notebook 96 28 Updated Jun 24, 2024

karpathy / makemore

An autoregressive character-level language model for making more things

Python 4,010 987 Updated Jun 4, 2024

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,912 4,064 Updated Jun 10, 2026

a-m-team / a-m-models

a-m-team's exploration in large language modeling

196 3 Updated May 29, 2025

glorgao / SelectiveDPO

Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples

Python 47 1 Updated Jul 16, 2025

rasmusgreve / MCTSMario

Monte Carlo Tree Search Mario AI

Java 31 12 Updated Dec 28, 2013

ggml-org / llama.cpp

LLM inference in C/C++

C++ 116,099 19,478 Updated Jun 11, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 28,913 6,470 Updated Jun 11, 2026

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 96,993 14,837 Updated Jun 2, 2026

hcengineering / platform

Huly — All-in-One Project Management Platform (alternative to Linear, Jira, Slack, Notion, Motion)

TypeScript 26,173 1,934 Updated Jun 10, 2026

kenjihiranabe / The-Art-of-Linear-Algebra

Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"

PostScript 21,556 2,565 Updated Jun 30, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,891 3,486 Updated Jun 11, 2026

magnusja / ppo

Forked from pat-coady/trpo

Proximal Policy Optimization with TensorFlow and OpenAI Gym

Jupyter Notebook 19 5 Updated Mar 31, 2018

benchmarking-rl / PARL-experiments

Experiments results of PARL

5 5 Updated Jul 5, 2023

ShuaibinLi / pygame-games

Make Fantastic games with pygame！

Python 2 Updated May 7, 2022

ljzycmd / SimDeblur

Simple framework for image and video deblurring, implemented by PyTorch

Python 346 40 Updated Dec 20, 2023

tuna / thuthesis

LaTeX Thesis Template for Tsinghua University

TeX 5,385 1,162 Updated May 27, 2026

int8 / monte-carlo-tree-search

Monte carlo tree search in python

Python 629 172 Updated Jul 2, 2022

haroldsultan / MCTS

Python Implementations of Monte Carlo Tree Search

Python 326 88 Updated Aug 20, 2021

AppliedDataSciencePartners / DeepReinforcementLearning

A replica of the AlphaZero methodology for deep reinforcement learning in Python

Jupyter Notebook 2,032 750 Updated Nov 21, 2022

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 11,806 2,453 Updated Aug 5, 2024

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,676 4,965 Updated Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Happy ShuaibinLi

Achievements

Achievements

Block or report ShuaibinLi

Stars

ShuaibinLi / RL_CARLA

alibaba / Pai-Megatron-Patch

Lightning-AI / pytorch-lightning

pengzhangzhi / Open-dLLM

RadicalNumerics / RND1

anthropics / claude-cookbooks

rail-berkeley / rlkit

ShuaibinLi / ESBox

fastai / course22-web

karpathy / makemore

verl-project / verl

a-m-team / a-m-models

glorgao / SelectiveDPO

rasmusgreve / MCTSMario

ggml-org / llama.cpp

sgl-project / sglang

rasbt / LLMs-from-scratch

hcengineering / platform

kenjihiranabe / The-Art-of-Linear-Algebra

gradio-app / gradio

magnusja / ppo

benchmarking-rl / PARL-experiments

ShuaibinLi / pygame-games

ljzycmd / SimDeblur

tuna / thuthesis

int8 / monte-carlo-tree-search

haroldsultan / MCTS

AppliedDataSciencePartners / DeepReinforcementLearning

openai / spinningup

ShangtongZhang / reinforcement-learning-an-introduction