Stars
An efficient goal-conditioned reinforcement learning environment for fixed-wing UAV velocity vector control based on Gymnasium (ICLR2025).
Implementation of my CS336 assignment1
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous soci…
在act开源项目2个仿真任务的基础上新增了sim_cupboard任务(抽屉收纳)
UE5C++教程,UE5C++Tutorial, Unreal Engine 5 C++Tutorial, Unreal Engine 5 C++ 教程
Official Implementation of "NeuralPlane: An Efficiently Parallelizable Platform for Fixed-wing Aircraft Control with Reinforcement Learning"
Using reinforcement learning for optical computing
Fast and differentiable particle accelerator optics simulation for reinforcement learning and optimisation applications.
PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
Developing godot envs for Air Combat and Multi-Uav Task allocation integrated with PettingZoo and Tianshou
A multi-agent reinforcement learning framework for optimizing coverage and connectivity in Space-Air-Ground integrated networks. This project simulates and trains intelligent agents to coordinate s…
Use interactive notebook to break down MiniMind code and learn from scratch.
Code for Fairness-Aware Offline Reinforcement Learning with Human Feedback (Fair-RLHF).
Patent : An anti-jamming communication method for unmanned cluster based on meta-reinforcement learning (一种基于元强化学习的无人集群抗干扰通信方法)
Transformer-PPO integrates the Decision Transformer architecture with Proximal Policy Optimization (PPO) to enhance reinforcement learning (RL) performance.
Transformer based forcasting of satellite orbital densities to inform control and decision making, MIT Arc Labs AI innovation challenge.
纯手工绘制 Transformer 架构图;Drawing the Transformer architecture diagram by hand
Project AirSim is Microsoft's evolution of AirSim, an advanced simulation platform for building, training, and testing autonomous systems in high-fidelity virtual environments
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.
A curated list of Decision Transformer resources (continually updated)