PPO, DDPG, SAC implementation on mujoco environment
-
Updated
Feb 16, 2022 - Python
PPO, DDPG, SAC implementation on mujoco environment
The repository is intended as a support tool for the report of the project "Sim to Real transfer of Reinforcement Learning Policies in Robotics" and it contains examples of some well-known algorithms and methods in the fields of Reinforcement Learning and Sim-to-Real transfer. The implementation is not thought to be efficient, thus we suggest yo…
Cross-platform FlashAttention-2 Triton implementation for Turing+ with custom configuration mode
Scripts for Hopper Disassembler
Code from "How useful is quantilization for mitigating specification-gaming?"
Repository associated with the publication ``Materials Matter: Investigating Functional Advantages of Bio-Inspired Materials via Simulated Robotic Hopping''
Project of MLDL2024 for Reinforcement Learning s328834, s328964, s328830
Implementation of T-REX and D-REX Inverse Reinforcement Learning (IRL) algorithm for learning form suboptimal demonstrations
My Hopper Disassembler scripts.
LLaMA-Factory FP8 training environment for NVIDIA Hopper GPUs. Fixes common configuration issues causing 2x slowdown with FP8 mixed precision.
some experiments with training and fine-tuning decision transformer
Add a description, image, and links to the hopper topic page so that developers can more easily learn about it.
To associate your repository with the hopper topic, visit your repo's landing page and select "manage topics."