Yijia-Chen

Yijia Chen Yijia-Chen

chief meditator @curio-research

87 followers · 57 following

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 68,726 9,955 Updated Mar 26, 2026

gnboorse / centipede

Constraint Satisfaction Problem Solver for Golang

Go 75 11 Updated Jul 11, 2022

radarlabs / unity-radar

Unity SDK for Radar, the leading geofencing and location tracking platform

C# 1 Updated Nov 3, 2025

tzafon / Tzafon-WayPoint

Tzafon-WayPoint is a robust, scalable solution for managing large fleets of browser instances. WayPoint stands out with unmatched cold‑start speed—launching up to a 1000 browser per second on stand…

Rust 85 10 Updated Apr 22, 2025

clockworklabs / SpacetimeDB

Development at the speed of light

Rust 24,418 976 Updated Apr 8, 2026

SeismicSystems / seismic-reth

Execution client for Seismic

Rust 144 63 Updated Apr 8, 2026

openai / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,689 4,946 Updated Aug 1, 2024

Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Python 1,277 191 Updated Feb 9, 2021

Farama-Foundation / Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 11,663 1,314 Updated Mar 28, 2026

eloialonso / diamond

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 2,009 149 Updated Dec 6, 2024

google-deepmind / acme

A library of reinforcement learning components and agents

Python 3,958 534 Updated Apr 8, 2026

GameGen-X / GameGen-X

329 19 Updated May 22, 2025

etched-ai / open-oasis

Inference script for Oasis 500M

Python 2,065 176 Updated Nov 8, 2024

salesforce / ai-economist

Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforce…

Python 106 28 Updated Aug 20, 2023

Farama-Foundation / PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Python 3,364 482 Updated Feb 6, 2026

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,928 27,436 Updated Apr 8, 2026

higgsfield / RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Jupyter Notebook 3,171 594 Updated Nov 4, 2021

Tencent / behaviac

behaviac is a framework of the game AI development, and it also can be used as a rapid game prototype design tool. behaviac supports the behavior tree, finite state machine and hierarchical task ne…

C# 3,033 814 Updated Jul 7, 2023