Skip to content
View zerlinwang's full-sized avatar
😃
Say hello
😃
Say hello

Block or report zerlinwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

35 stars written in Jupyter Notebook
Clear filter

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 78,026 11,522 Updated Nov 6, 2025

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

Jupyter Notebook 43,754 14,843 Updated Jul 26, 2024

Google Research

Jupyter Notebook 36,671 8,232 Updated Oct 30, 2025

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,684 6,165 Updated Jul 13, 2023

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 17,279 4,643 Updated Jun 21, 2022

FinRL®: Financial Reinforcement Learning. 🔥

Jupyter Notebook 13,066 3,004 Updated Oct 13, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 12,928 2,148 Updated Sep 6, 2025

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Jupyter Notebook 11,666 1,711 Updated Apr 26, 2025

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Jupyter Notebook 10,817 1,390 Updated Nov 4, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,466 493 Updated Mar 22, 2024

Flax is a neural network library for JAX that is designed for flexibility.

Jupyter Notebook 6,897 753 Updated Nov 6, 2025

Acceptance rates for the major AI conferences

Jupyter Notebook 4,655 312 Updated Sep 23, 2025

https://hrl.boyuai.com/

Jupyter Notebook 4,136 759 Updated Nov 22, 2022

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,007 650 Updated Oct 31, 2025

Massively parallel rigidbody physics simulation on accelerator hardware.

Jupyter Notebook 2,915 316 Updated Oct 30, 2025

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈

Jupyter Notebook 2,193 428 Updated Jun 4, 2025

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Jupyter Notebook 1,993 349 Updated Sep 26, 2025

机器学习方法习题解答,在线阅读地址:https://datawhalechina.github.io/statistical-learning-method-solutions-manual

Jupyter Notebook 1,960 245 Updated Sep 9, 2025

主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。

Jupyter Notebook 1,780 821 Updated Mar 16, 2022
Jupyter Notebook 1,486 98 Updated Nov 5, 2025

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

Jupyter Notebook 968 127 Updated May 30, 2025

JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

Jupyter Notebook 727 72 Updated Oct 26, 2022

1 million FPS multi-agent driving simulator

Jupyter Notebook 539 71 Updated Oct 2, 2025

DrQ: Data regularized Q

Jupyter Notebook 417 54 Updated Jan 13, 2023

PyTorch implementation of Tacotron speech synthesis model.

Jupyter Notebook 308 79 Updated Jul 12, 2019

[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".

Jupyter Notebook 131 21 Updated Jan 29, 2024
Jupyter Notebook 109 6 Updated Feb 25, 2025

VBD: Versatile Behavior Diffusion for Generalized Traffic Agent Simulation

Jupyter Notebook 83 9 Updated Jan 2, 2025

Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"

Jupyter Notebook 75 2 Updated Nov 4, 2025

Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments

Jupyter Notebook 60 3 Updated Aug 19, 2024
Next