Skip to content
View seolhokim's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report seolhokim

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python Backtesting library for trading strategies

Python 19,838 4,829 Updated Aug 19, 2024

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,982 1,652 Updated Nov 19, 2025

Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"

Python 603 44 Updated Oct 29, 2024

AllenAI's post-training codebase

Python 3,470 477 Updated Dec 23, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,464 434 Updated Sep 13, 2024

Accurate answers and instant citations for your documents.

Python 1,657 804 Updated May 29, 2024

Offline, privacy-first grammar checker. Fast, open-source, Rust-powered

Rust 8,847 239 Updated Dec 22, 2025

NanoGPT (124M) in 3 minutes

Python 4,001 528 Updated Dec 22, 2025

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,936 139 Updated Dec 6, 2024

Code for "Learning to Model the World with Language." ICML 2024 Oral.

Python 414 32 Updated Sep 21, 2023

🤖 AgentVerse 🪐 is designed to facilitate the deployment of multiple LLM-based agents in various applications, which primarily provides two frameworks: task-solving and simulation

JavaScript 4,899 486 Updated Sep 9, 2024

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Python 7 1 Updated Apr 11, 2024

[ECCV'24] SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic

Python 205 11 Updated Jul 14, 2025

A curated list of awesome exploration RL resources (continually updated)

607 21 Updated Dec 2, 2025

METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)

Python 81 8 Updated Oct 15, 2023

Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.

Python 452 112 Updated Apr 2, 2023

A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)

193 5 Updated Aug 6, 2025

This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.

Jupyter Notebook 32 7 Updated Oct 28, 2025

Gymnasium extension for DarkSouls III, Elden Ring, and other Souls games

Python 141 14 Updated Oct 20, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 612 38 Updated Feb 10, 2024

PyTorch implementation of Contrastive Learning methods

Python 1,996 184 Updated Oct 4, 2023

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,487 183 Updated Dec 19, 2025
Python 724 77 Updated Jun 20, 2023

Initiative to read research papers

180 37 Updated Dec 30, 2023

Resources on various topics being worked on at IvLabs

350 63 Updated Nov 17, 2023

MetaDrive: Lightweight driving simulator for everyone

Python 1,070 170 Updated Aug 15, 2025

A simple, easy, customizable Gymnasium environment for trading.

Python 467 100 Updated Mar 19, 2025

PyTorch implementation of a Variational Autoencoder with Gumbel-Softmax Distribution

Python 214 35 Updated Sep 5, 2018
Next