-
Peking University, China
- Yokohama, Japan
- https://github.com/CraKane
Stars
Generative Agents: Interactive Simulacra of Human Behavior
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
My continuously updated Machine Learning, Probabilistic Models and Deep Learning notes and demos (2000+ slides) 我不间断更新的机器学习,概率模型和深度学习的讲义(2000+页)和视频链接
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
FinRL®: Financial Reinforcement Learning. 🔥
This is the official implementation of Multi-Agent PPO (MAPPO).
PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
An improved version of EOI on Starcraft II task so_many_baneling. (The Emergence of Individuality)
清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University
PyTorch implementations of deep reinforcement learning algorithms and environments
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Scalable Multi-Agent RL Training School for Autonomous Driving
Python Multi-Agent Reinforcement Learning framework
智能巡逻机器人,可以以一定距离主动跟随人,主动前进后退,可以是别人的骨架和手势,同时根据人体的衣服颜色分辨身份,敌人还是朋友,敌人为红色,朋友为蓝色。
An elegant PyTorch deep reinforcement learning library.
Data has 6 features & 1 output. the target for this is regression by at least 3 machine learning methods. And at least 1 NN method.
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。