🎮 Train NPCs using Proximal Policy Optimization in a browser-based 3D voxel environment for dynamic multi-agent reinforcement learning.
-
Updated
Apr 12, 2026 - JavaScript
🎮 Train NPCs using Proximal Policy Optimization in a browser-based 3D voxel environment for dynamic multi-agent reinforcement learning.
RLHF/PPO Training Pipeline with Performance Profiling and Optimization Demonstrations
Track coding progress and build a practical web development reference with examples, notes, and exercises focused on HTML, CSS, and JavaScript.
Multi-agent reinforcement learning framework for training NPCs in browser-based 3D voxel hide-and-seek using PPO and WebSocket communication between Ray RLlib and THREE.js
RLHF pipeline for Hinge bio generation — human preferences → reward model → PPO alignment
Add a description, image, and links to the ppo topic page so that developers can more easily learn about it.
To associate your repository with the ppo topic, visit your repo's landing page and select "manage topics."