Skip to content
View Tomatoaac's full-sized avatar

Block or report Tomatoaac

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An efficient goal-conditioned reinforcement learning environment for fixed-wing UAV velocity vector control based on Gymnasium (ICLR2025).

Python 84 1 Updated Jul 2, 2025

Implementation of my CS336 assignment1

Python 9 2 Updated Oct 31, 2025

Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…

Python 4,042 653 Updated Apr 2, 2025

A god-simulation sandbox game built on Godot 4 as a multi-agent AI social simulation system. In this virtual world, AI characters possess independent thinking and memory, capable of autonomous soci…

GDScript 1,455 242 Updated Oct 19, 2025

在act开源项目2个仿真任务的基础上新增了sim_cupboard任务(抽屉收纳)

Python 3 Updated May 21, 2025
Cuda 334 47 Updated Jun 25, 2025

UE5C++教程,UE5C++Tutorial, Unreal Engine 5 C++Tutorial, Unreal Engine 5 C++ 教程

C++ 43 2 Updated Mar 25, 2025

Official Implementation of "NeuralPlane: An Efficiently Parallelizable Platform for Fixed-wing Aircraft Control with Reinforcement Learning"

Python 48 4 Updated Dec 17, 2024
Python 8 Updated Jul 30, 2025

Using reinforcement learning for optical computing

Python 2 Updated Jun 9, 2025

Fast and differentiable particle accelerator optics simulation for reinforcement learning and optimisation applications.

Python 59 23 Updated Nov 5, 2025

PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control

Python 1,685 479 Updated Nov 2, 2025

https://hrl.boyuai.com/

Jupyter Notebook 4,133 759 Updated Nov 22, 2022

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Python 1,644 270 Updated Jul 21, 2023

Developing godot envs for Air Combat and Multi-Uav Task allocation integrated with PettingZoo and Tianshou

GDScript 32 6 Updated Jun 26, 2025

A multi-agent reinforcement learning framework for optimizing coverage and connectivity in Space-Air-Ground integrated networks. This project simulates and trains intelligent agents to coordinate s…

Python 39 4 Updated Oct 25, 2025

LLM + MCP + RAG = Magic

TypeScript 408 61 Updated Apr 6, 2025

Use interactive notebook to break down MiniMind code and learn from scratch.

Jupyter Notebook 102 17 Updated Mar 28, 2025

Code for Fairness-Aware Offline Reinforcement Learning with Human Feedback (Fair-RLHF).

Jupyter Notebook 1 Updated Aug 31, 2025

Patent : An anti-jamming communication method for unmanned cluster based on meta-reinforcement learning (一种基于元强化学习的无人集群抗干扰通信方法)

Jupyter Notebook 20 1 Updated Oct 29, 2024

Transformer-PPO integrates the Decision Transformer architecture with Proximal Policy Optimization (PPO) to enhance reinforcement learning (RL) performance.

Python 8 Updated Apr 2, 2025

Transformer based forcasting of satellite orbital densities to inform control and decision making, MIT Arc Labs AI innovation challenge.

Jupyter Notebook 1 Updated May 8, 2025

纯手工绘制 Transformer 架构图;Drawing the Transformer architecture diagram by hand

15 Updated Jul 29, 2025

Project AirSim is Microsoft's evolution of AirSim, an advanced simulation platform for building, training, and testing autonomous systems in high-fidelity virtual environments

C++ 326 43 Updated Oct 31, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,205 545 Updated Oct 30, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 32,501 3,758 Updated Nov 2, 2025

Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.

Python 26 2 Updated Nov 12, 2024

光电赛强化学习

Python 3 Updated Mar 28, 2025

强化学习 airsim场景

Python 3 Updated Apr 1, 2025

A curated list of Decision Transformer resources (continually updated)

827 35 Updated Sep 12, 2025
Next