Skip to content
View bhyang's full-sized avatar

Block or report bhyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Rust 10,724 746 Updated Dec 26, 2025

Simplifying reinforcement learning for complex game environments

C 4,652 349 Updated Dec 19, 2025

DSPy: The framework for programming—not prompting—language models

Python 31,049 2,501 Updated Dec 23, 2025

Really Fast End-to-End Jax RL Implementations

Python 1,006 82 Updated Sep 9, 2024

[arXiv] What-If Motion Prediction for Autonomous Driving ❓🚗💨

Python 123 22 Updated Dec 1, 2021

[CoRL'22] PlanT: Explainable Planning Transformers via Object-Level Representations

Python 289 38 Updated Nov 25, 2025
Python 118 29 Updated Jul 31, 2025

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 291 32 Updated Jun 18, 2024

Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]

Python 41 6 Updated Aug 27, 2022

Pytorch version of Dreamer, which follows the original TF v2 codes.

Python 139 24 Updated Feb 7, 2022

Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.

Python 272 49 Updated Jul 29, 2023

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python 918 142 Updated Dec 20, 2023

MuZero

Python 2,744 665 Updated Sep 3, 2024

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

Python 78 20 Updated Nov 19, 2022

Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238

Python 48 6 Updated Nov 10, 2020

(ICCV 2021, Oral) RL and distillation in CARLA using a factorized world model

Python 183 31 Updated Feb 17, 2022

Official GitHub repository for Argoverse dataset

Python 925 257 Updated Dec 15, 2023

Driving in CARLA using model-free deep reinforcement learning

Python 61 17 Updated Feb 2, 2021

Learning Invariant Representations for Reinforcement Learning without Reconstruction

Python 155 38 Updated Aug 31, 2021

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,625 3,631 Updated Dec 22, 2025

An offline deep reinforcement learning library

Python 1,613 261 Updated Sep 10, 2025

The "Python Machine Learning (2nd edition)" book code repository and info resource

Jupyter Notebook 7,194 2,819 Updated Oct 1, 2020
Python 2 2 Updated Mar 19, 2019

a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)

Python 229 26 Updated Dec 27, 2022

Non-parametric Entropy Estimation Toolbox

Python 422 94 Updated Oct 5, 2022
Python 14 8 Updated Aug 7, 2019

🎯 A comprehensive gradient-free optimization framework written in Python

Python 580 59 Updated Jul 19, 2019

Bayesian optimization

Jupyter Notebook 38 13 Updated Dec 3, 2019
Next