Tomatoaac

Tomattoo Tomatoaac

UCAS

Stars

GongXudong / fly-craft

An efficient goal-conditioned reinforcement learning environment for fixed-wing UAV velocity vector control based on Gymnasium (ICLR2025).

Python 84 1 Updated Jul 2, 2025

donglinkang2021 / cs336-assignment1-basics

Implementation of my CS336 assignment1

Python 9 2 Updated Oct 31, 2025

PaddlePaddle / PaddleRec

Recommendation Algorithm大规模推荐算法库，包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM，DSIN，SIGN，IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…

Python 4,042 653 Updated Apr 2, 2025

KsanaDock / Microverse

A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous soci…

GDScript 1,455 242 Updated Oct 19, 2025

HJX-exoskeleton / act-sim_cupboard

在act开源项目2个仿真任务的基础上新增了sim_cupboard任务（抽屉收纳）

Python 3 Updated May 21, 2025

HenryHuYu / DiffPhysDrone

Cuda 334 47 Updated Jun 25, 2025

Jiejie-UE / UE5CplusplusTutorial

UE5C++教程,UE5C++Tutorial, Unreal Engine 5 C++Tutorial, Unreal Engine 5 C++ 教程

C++ 43 2 Updated Mar 25, 2025

xuecy22 / NeuralPlane

Official Implementation of "NeuralPlane: An Efficiently Parallelizable Platform for Fixed-wing Aircraft Control with Reinforcement Learning"

Python 48 4 Updated Dec 17, 2024

xuecy22 / AeroPlanax

Python 8 Updated Jul 30, 2025

sul31man / RL-optics

Using reinforcement learning for optical computing

Python 2 Updated Jun 9, 2025

desy-ml / cheetah

Fast and differentiable particle accelerator optics simulation for reinforcement learning and optimisation applications.

Python 59 23 Updated Nov 5, 2025

utiasDSL / gym-pybullet-drones

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

Python 1,685 479 Updated Nov 2, 2025

boyu-ai / Hands-on-RL

https://hrl.boyuai.com/

Jupyter Notebook 4,133 759 Updated Nov 22, 2022

openai / neural-mmo

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Python 1,644 270 Updated Jul 21, 2023

andrekuros / B-ACE

Developing godot envs for Air Combat and Multi-Uav Task allocation integrated with PettingZoo and Tianshou

GDScript 32 6 Updated Jun 26, 2025

Happymic / SkyNetRL-Multi-Agent-Reinforcement-Learning-for-Space-Air-Ground-Networks

A multi-agent reinforcement learning framework for optimizing coverage and connectivity in Space-Air-Ground integrated networks. This project simulates and trains intelligent agents to coordinate s…

Python 39 4 Updated Oct 25, 2025

KelvinQiu802 / llm-mcp-rag

LLM + MCP + RAG = Magic

TypeScript 408 61 Updated Apr 6, 2025

Nijikadesu / breakdown-minimind

Use interactive notebook to break down MiniMind code and learn from scratch.

Jupyter Notebook 102 17 Updated Mar 28, 2025

Harshit052610 / fair-rlhf-code

Code for Fairness-Aware Offline Reinforcement Learning with Human Feedback (Fair-RLHF).

Jupyter Notebook 1 Updated Aug 31, 2025

d3ac / MetaRL-for-UAV-Anti-jamming

Patent : An anti-jamming communication method for unmanned cluster based on meta-reinforcement learning (一种基于元强化学习的无人集群抗干扰通信方法)

Jupyter Notebook 20 1 Updated Oct 29, 2024

mtr26 / Transformer-PPO

Transformer-PPO integrates the Decision Transformer architecture with Proximal Policy Optimization (PPO) to enhance reinforcement learning (RL) performance.

Python 8 Updated Apr 2, 2025

Ardrito / AI_challenge

Transformer based forcasting of satellite orbital densities to inform control and decision making, MIT Arc Labs AI innovation challenge.

Jupyter Notebook 1 Updated May 8, 2025

An-Jhon / Hand-Drawn-Transformer

纯手工绘制 Transformer 架构图；Drawing the Transformer architecture diagram by hand

15 Updated Jul 29, 2025

iamaisim / ProjectAirSim

Project AirSim is Microsoft's evolution of AirSim, an advanced simulation platform for building, training, and testing autonomous systems in high-fidelity virtual environments

C++ 326 43 Updated Oct 31, 2025

jingyaogong / minimind-v

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM！🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,205 545 Updated Oct 30, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,501 3,758 Updated Nov 2, 2025

Jaewoopudding / GTA

Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.

Python 26 2 Updated Nov 12, 2024

huapu-kaf / light

光电赛强化学习

Python 3 Updated Mar 28, 2025

deameW / AirSim212

强化学习 airsim场景

Python 3 Updated Apr 1, 2025

opendilab / awesome-decision-transformer

A curated list of Decision Transformer resources (continually updated)

827 35 Updated Sep 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly