Skip to content
View 0Pinky0's full-sized avatar

Block or report 0Pinky0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
23 stars written in Jupyter Notebook
Clear filter

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,817 1,390 Updated Nov 4, 2024

Book_3_《数学要素》 | 鸢尾花书:从加减乘除到机器学习;上架;欢迎继续纠错,纠错多的同学还会有赠书!

Jupyter Notebook 7,189 1,281 Updated Jan 26, 2025

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,896 753 Updated Nov 6, 2025

Book_5_《统计至简》 | 鸢尾花书:从加减乘除到机器学习;上架!

Jupyter Notebook 3,472 719 Updated Feb 6, 2025

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Jupyter Notebook 2,951 194 Updated Nov 3, 2025

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Jupyter Notebook 1,993 349 Updated Sep 26, 2025

机器学习方法习题解答,在线阅读地址:https://datawhalechina.github.io/statistical-learning-method-solutions-manual

Jupyter Notebook 1,960 245 Updated Sep 9, 2025

PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..

Jupyter Notebook 1,302 140 Updated Mar 13, 2025

RL implementations

Jupyter Notebook 1,229 187 Updated Nov 5, 2025

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Jupyter Notebook 968 127 Updated May 30, 2025
Jupyter Notebook 920 108 Updated Jun 27, 2024

《机器学习》(西瓜书)代码实战

Jupyter Notebook 911 186 Updated May 7, 2025

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 654 63 Updated Apr 20, 2025

Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math

Jupyter Notebook 447 141 Updated Apr 1, 2021

cs224w(图机器学习)2021冬季课程的colab

Jupyter Notebook 245 41 Updated Jul 9, 2021
Jupyter Notebook 243 23 Updated Feb 18, 2025

basic algorithms of reinforcement learning

Jupyter Notebook 214 55 Updated Aug 23, 2023

Here lies all the exercises I implement and share in my website

Jupyter Notebook 212 156 Updated Jun 17, 2024

UCB EECS126 : probability theory and random processes.

Jupyter Notebook 147 44 Updated Jul 14, 2022
Jupyter Notebook 91 28 Updated Dec 9, 2020

example on how to setup pybind11 extension builds with poetry

Jupyter Notebook 33 6 Updated Oct 6, 2022

TLoL - League of Legends Deep Learning AI (Research and Development)

Jupyter Notebook 32 3 Updated Dec 22, 2023