Starred repositories
Bayesian optimisation & Reinforcement Learning library developed by Huawei Noah's Ark Lab
Official implementation for the NeurIPS 2023 paper: "Reduced Policy Optimization for Continuous Control with Hard Constraints"
[AAAI 2023 Oral] Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition
2026最新悄咪咪收集的10000+个Telegram群合集,附全网最有趣好用的机器人BOT🤖【dianbaodaohang.com】
Master Federated Learning in 2 Hours—Run It on Your PC!
神领物流 黑马 物流项目 神领物流系统类似顺丰速运,是向C端用户提供快递服务的系统。竞品有:顺丰、中通、圆通、京东快递等。 项目产品主要有4端产品: - 用户端:基于微信小程序开发,外部客户使用,可以寄件、查询物流信息等。 - 快递员端:基于安卓开发的手机APP,公司内部的快递员使用,可以接收取派件任务等。 - 司机端:基于安卓开发的手机APP,公司内部的司机使用,可以接收运输任务、上报位置…
INFERLab / Gnu-RL
Forked from bingqingchen/Gnu-RLA precocial reinforcement learning solution for HVAC control
Space to perform my tests regarding my MSc research about energy and RL
This is the code for paper: Scalable Federated Multi-agent Architecture forNetworked Communication Scenarios
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
DSAC-v2; DSAC-T; DASC; Distributional Soft Actor-Critic
We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enhanced algorithm achieves 100% win rates on SMAC-V1 and superi…
[ICML'23] Official PyTorch Implementation of NA2Q, and a comprehensive benchmark based on pymarl
(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution
Code for ICML2023 accepted paper: Complementary Attention for Multi-Agent Reinforcement Learning.
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Official repository of the paper TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems (AAMAS 2023)
Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation