Skip to content
View chengscott's full-sized avatar

Highlights

  • Pro

Organizations

@tw-csie-sprout @pcshjq @pcshic @nthuion

Block or report chengscott

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
291 26 Updated Sep 23, 2025

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,217 119 Updated Aug 12, 2024

Grammars written for ANTLR v4; expectation that the grammars are free of actions.

ANTLR 10,881 3,796 Updated Dec 17, 2025

A framework for modeling non-stationary Markov decision processes and the key decision making problems in these environments

Python 6 Updated Dec 15, 2025

A non-saturating, open-ended environment for evaluating LLMs in Factorio

Python 870 60 Updated Dec 16, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,567 2,841 Updated Dec 18, 2025

NVidia sass disassembler/inline patcher

C++ 34 2 Updated Dec 17, 2025

An interface library for RL post training with environments.

Python 848 130 Updated Dec 17, 2025

torchcomms: a modern PyTorch communications API

C++ 309 46 Updated Dec 18, 2025

Machine Learning Engineering Open Book

Python 16,058 987 Updated Dec 10, 2025

Latex Book/Note Writing Tutorial

TeX 750 46 Updated Dec 9, 2025

The contents of /mnt/skills in Claude's code interpreter environment

890 128 Updated Dec 12, 2025

Epistemic AlphaZero utilizes uncertainty to explore and learn even when AlphaZero gets stuck.

Python 4 1 Updated Sep 15, 2025

stdgpu: Efficient STL-like Data Structures on the GPU

C++ 1,238 93 Updated Dec 15, 2025

Really Fast End-to-End Jax RL Implementations

Python 1,003 82 Updated Sep 9, 2024
C++ 604 102 Updated Dec 18, 2025
Shell 23 3 Updated Apr 3, 2021

♟️ Vectorized RL game environments in JAX

Python 558 40 Updated Mar 6, 2025

AlphaZero based engine for the game of Go (圍棋/围棋).

C++ 114 12 Updated Dec 1, 2025

Chisel: A Modern Hardware Design Language

Scala 4,509 642 Updated Dec 17, 2025

open-source IEEE 802.11 WiFi baseband FPGA (chip) design: driver, software

C 4,448 750 Updated Dec 16, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,247 347 Updated Dec 18, 2025

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,905 222 Updated Sep 30, 2023

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,168 120 Updated Nov 9, 2025

Self-hosted AI coding assistant

Rust 32,605 1,658 Updated Dec 15, 2025

Tensor library for machine learning

C++ 13,726 1,429 Updated Dec 17, 2025

An open-source efficient deep learning framework/compiler, written in python.

Python 737 68 Updated Sep 4, 2025

OpTree: Optimized PyTree Utilities

Python 202 12 Updated Dec 17, 2025

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 326 37 Updated Aug 6, 2024

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,568 128 Updated Nov 24, 2025
Next