Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)
Google Research
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
FinRL®: Financial Reinforcement Learning. 🔥
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Using Low-rank adaptation to quickly fine-tune diffusion models.
Flax is a neural network library for JAX that is designed for flexibility.
Acceptance rates for the major AI conferences
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
Massively parallel rigidbody physics simulation on accelerator hardware.
TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning 🔥 ⚡ 🌈
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
机器学习方法习题解答,在线阅读地址:https://datawhalechina.github.io/statistical-learning-method-solutions-manual
主要存储Datawhale组队学习中“数据挖掘/机器学习”方向的资料。
Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
1 million FPS multi-agent driving simulator
PyTorch implementation of Tacotron speech synthesis model.
[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".
VBD: Versatile Behavior Diffusion for Generalized Traffic Agent Simulation
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments