JordanAsh

JordanAsh

Achievements

badge badge Public

An implementation of the BADGE batch active learning algorithm.

Python 217 37
warm_start warm_start Public

Code corresponding to 'On Warm-Starting Neural Network Training'

Python 9 1
boostresnet boostresnet Public

A PyTorch implementation of BoostResNet

Python 5 3
acb acb Public

A PyTorch implementation of the Anti-concentrated Confidence Bonus (ACB) for promoting exploration in deep reinforcement learning.

Python 2
jordanash.github.io jordanash.github.io Public

HTML
plasticity-rl-experiments plasticity-rl-experiments Public

Plasticity loss in RL post-training: GRPO vs SFT on Qwen2.5-1.5B (GSM8K + MATH)

Python