🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript 4,899 486 Updated Sep 9, 2024

seolhokim / SimpleDistributedRL

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Python 7 1 Updated Apr 11, 2024

autonomousvision / sledge

[ECCV'24] SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic

Python 205 11 Updated Jul 14, 2025

opendilab / awesome-exploration-rl

A curated list of awesome exploration RL resources (continually updated)

607 21 Updated Dec 2, 2025

seohongpark / METRA

METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)

Python 81 8 Updated Oct 15, 2023

openai / safety-starter-agents

Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.

Python 452 112 Updated Apr 2, 2023

mengdi-li / awesome-RLAIF

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

193 5 Updated Aug 6, 2025

understanding-search / maze-transformer

This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.

Jupyter Notebook 32 7 Updated Oct 28, 2025

amacati / SoulsGym

Gymnasium extension for DarkSouls III, Elden Ring, and other Souls games

Python 141 14 Updated Oct 20, 2024

corl-team / CORL

Forked from tinkoff-ai/CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 612 38 Updated Feb 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seolhokim seolhokim

Achievements

Achievements

Block or report seolhokim

Stars

mementum / backtrader

nari-labs / dia

anhnh2002 / XTTSv2-Finetuning-for-New-Languages

choosewhatulike / trainable-agents

allenai / open-instruct

imoneoi / openchat

mmz-001 / knowledge_gpt

Automattic / harper

sarthakrastogi / quality-prompts

KellerJordan / modded-nanogpt

eloialonso / diamond

jlin816 / dynalang

OpenBMB / AgentVerse

seolhokim / SimpleDistributedRL

autonomousvision / sledge

opendilab / awesome-exploration-rl

seohongpark / METRA

openai / safety-starter-agents

mengdi-li / awesome-RLAIF

understanding-search / maze-transformer

amacati / SoulsGym

corl-team / CORL

HobbitLong / PyContrast

opendilab / LightZero

google-deepmind / alphadev

IvLabs / ResearchPaperNotes

IvLabs / resources

metadriverse / metadrive

ClementPerroud / Gym-Trading-Env

YongfeiYan / Gumbel_Softmax_VAE