chengscott

Scott Cheng chengscott

78 followers · 102 following

Open to Work
PhD candidate @ Penn State CSE
chengscott.io

Achievements

Highlights

Organizations

Stars

NVlabs / NVBit

291 26 Updated Sep 23, 2025

sail-sg / envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,217 119 Updated Aug 12, 2024

antlr / grammars-v4

Grammars written for ANTLR v4; expectation that the grammars are free of actions.

ANTLR 10,881 3,796 Updated Dec 17, 2025

scope-lab-vu / ns_gym

A framework for modeling non-stationary Markov decision processes and the key decision making problems in these environments

Python 6 Updated Dec 15, 2025

JackHopkins / factorio-learning-environment

A non-saturating, open-ended environment for evaluating LLMs in Factorio

Python 870 60 Updated Dec 16, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,567 2,841 Updated Dec 18, 2025

redplait / denvdis

NVidia sass disassembler/inline patcher

C++ 34 2 Updated Dec 17, 2025

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 848 130 Updated Dec 17, 2025

meta-pytorch / torchcomms

torchcomms: a modern PyTorch communications API

C++ 309 46 Updated Dec 18, 2025

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 16,058 987 Updated Dec 10, 2025

BenjaminGor / Latex_Notes_Tutorial

Latex Book/Note Writing Tutorial

TeX 750 46 Updated Dec 9, 2025

simonw / claude-skills

The contents of /mnt/skills in Claude's code interpreter environment

890 128 Updated Dec 12, 2025

emcts / e-alphazero

Epistemic AlphaZero utilizes uncertainty to explore and learn even when AlphaZero gets stuck.

Python 4 1 Updated Sep 15, 2025

stotko / stdgpu

stdgpu: Efficient STL-like Data Structures on the GPU

C++ 1,238 93 Updated Dec 15, 2025

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,003 82 Updated Sep 9, 2024

NVIDIA / cuCollections

C++ 604 102 Updated Dec 18, 2025

Qwertylex / samdecrypt.sh

Shell 23 3 Updated Apr 3, 2021

sotetsuk / pgx

♟️ Vectorized RL game environments in JAX

Python 558 40 Updated Mar 6, 2025

CGLemon / Sayuri

AlphaZero based engine for the game of Go (圍棋/围棋).

C++ 114 12 Updated Dec 1, 2025

chipsalliance / chisel

Chisel: A Modern Hardware Design Language

Scala 4,509 642 Updated Dec 17, 2025

open-sdr / openwifi

open-source IEEE 802.11 WiFi baseband FPGA (chip) design: driver, software

C 4,448 750 Updated Dec 16, 2025

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,247 347 Updated Dec 18, 2025

turboderp / exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,905 222 Updated Sep 30, 2023

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,168 120 Updated Nov 9, 2025

TabbyML / tabby

Self-hosted AI coding assistant

Rust 32,605 1,658 Updated Dec 15, 2025

ggml-org / ggml

Tensor library for machine learning

C++ 13,726 1,429 Updated Dec 17, 2025

hidet-org / hidet

An open-source efficient deep learning framework/compiler, written in python.

Python 737 68 Updated Sep 4, 2025

metaopt / optree

OpTree: Optimized PyTree Utilities

Python 202 12 Updated Dec 17, 2025

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 326 37 Updated Aug 6, 2024

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,568 128 Updated Nov 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scott Cheng chengscott

Achievements

Achievements

Highlights

Organizations

Block or report chengscott

Stars

NVlabs / NVBit

sail-sg / envpool

antlr / grammars-v4

scope-lab-vu / ns_gym

JackHopkins / factorio-learning-environment

volcengine / verl

redplait / denvdis

meta-pytorch / OpenEnv

meta-pytorch / torchcomms

stas00 / ml-engineering

BenjaminGor / Latex_Notes_Tutorial

simonw / claude-skills

emcts / e-alphazero

stotko / stdgpu

luchris429 / purejaxrl

NVIDIA / cuCollections

Qwertylex / samdecrypt.sh

sotetsuk / pgx

CGLemon / Sayuri

chipsalliance / chisel

open-sdr / openwifi

tile-ai / tilelang

turboderp / exllama

TsinghuaC3I / Awesome-RL-for-LRMs

TabbyML / tabby

ggml-org / ggml

hidet-org / hidet

metaopt / optree

YuxiXie / MCTS-DPO

PKU-Alignment / safe-rlhf