bhyang

Brian Yang bhyang

byang.org

Achievements

Stars

tensorzero / tensorzero

TensorZero is an open-source stack for industrial-grade LLM applications. It unifies an LLM gateway, observability, optimization, evaluation, and experimentation.

Rust 10,724 746 Updated Dec 26, 2025

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,652 349 Updated Dec 19, 2025

RoboStack / robostack.github.io

Python 315 29 Updated Dec 26, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 31,049 2,501 Updated Dec 23, 2025

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,006 82 Updated Sep 9, 2024

Zhendong-Wang / Diffusion-Policies-for-Offline-RL

Python 413 46 Updated Apr 29, 2024

wqi / WIMP

[arXiv] What-If Motion Prediction for Autonomous Driving ❓🚗💨

Python 123 22 Updated Dec 1, 2021

autonomousvision / plant

[CoRL'22] PlanT: Explainable Planning Transformers via Object-Level Representations

Python 289 38 Updated Nov 25, 2025

roggirg / AutoBots

Python 118 29 Updated Jul 31, 2025

facebookresearch / nocturne

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Python 291 32 Updated Jun 18, 2024

hari-sikchi / LOOP

Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]

Python 41 6 Updated Aug 27, 2022

jsikyoon / dreamer-torch

Pytorch version of Dreamer, which follows the original TF v2 codes.

Python 139 24 Updated Feb 7, 2022

RajGhugare19 / dreamerv2

Pytorch implementation of Dreamer-v2: Visual Model Based RL Algorithm.

Python 272 49 Updated Jul 29, 2023

YeWR / EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Python 918 142 Updated Dec 20, 2023

werner-duvaud / muzero-general

MuZero

Python 2,744 665 Updated Sep 3, 2024

daisatojp / mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

Python 78 20 Updated Nov 19, 2022

YYCAAA / V-MPO_Lunarlander

Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238

Python 48 6 Updated Nov 10, 2020

dotchen / WorldOnRails

(ICCV 2021, Oral) RL and distillation in CARLA using a factorized world model

Python 183 31 Updated Feb 17, 2022

argoverse / argoverse-api

Official GitHub repository for Argoverse dataset

Python 925 257 Updated Dec 15, 2023

valeoai / LearningByCheating

Forked from dotchen/LearningByCheating

Driving in CARLA using model-free deep reinforcement learning

Python 61 17 Updated Feb 2, 2021

facebookresearch / deep_bisim4control

Learning Invariant Representations for Reinforcement Learning without Reconstruction

Python 155 38 Updated Aug 31, 2021

Lightning-AI / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

Python 30,625 3,631 Updated Dec 22, 2025

takuseno / d3rlpy

An offline deep reinforcement learning library

Python 1,613 261 Updated Sep 10, 2025

rasbt / python-machine-learning-book-2nd-edition

The "Python Machine Learning (2nd edition)" book code repository and info resource

Jupyter Notebook 7,194 2,819 Updated Oct 1, 2020

chrisdxie / reminiscent_tracker

Python 2 2 Updated Mar 19, 2019

Hellisotherpeople / CX_DB8

a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sentence Encoder, Flair)

Python 229 26 Updated Dec 27, 2022

gregversteeg / NPEET

Non-parametric Entropy Estimation Toolbox

Python 422 94 Updated Oct 5, 2022

gkahn13 / arxiv-filter

Python 14 8 Updated Aug 7, 2019

100 / Solid

🎯 A comprehensive gradient-free optimization framework written in Python

Python 580 59 Updated Jul 19, 2019

jmetzen / bayesian_optimization

Bayesian optimization

Jupyter Notebook 38 13 Updated Dec 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Brian Yang bhyang

Achievements

Achievements

Block or report bhyang

Stars

tensorzero / tensorzero

PufferAI / PufferLib

RoboStack / robostack.github.io

stanfordnlp / dspy

luchris429 / purejaxrl

Zhendong-Wang / Diffusion-Policies-for-Offline-RL

wqi / WIMP

autonomousvision / plant

roggirg / AutoBots

facebookresearch / nocturne

hari-sikchi / LOOP

jsikyoon / dreamer-torch

RajGhugare19 / dreamerv2

YeWR / EfficientZero

werner-duvaud / muzero-general

daisatojp / mpo

YYCAAA / V-MPO_Lunarlander

dotchen / WorldOnRails

argoverse / argoverse-api

valeoai / LearningByCheating

facebookresearch / deep_bisim4control

Lightning-AI / pytorch-lightning

takuseno / d3rlpy

rasbt / python-machine-learning-book-2nd-edition

chrisdxie / reminiscent_tracker

Hellisotherpeople / CX_DB8

gregversteeg / NPEET

gkahn13 / arxiv-filter

100 / Solid

jmetzen / bayesian_optimization