I am a Senior Machine Learning Software Engineer.
Focused on reinforcement learning, AI infrastructure, and building reliable and scalable software for AI systems.
- gym-puddle: Off-policy PAC algorithm implemented on the Puddle World Gymnasium environment using TorchRL
- proprio: Unsupervised, uncertainty-aware perception for a 7-DOF robot arm; classifies each lidar reading as self, background, or anomaly, without any geometry or kinematics.
- AlphaEx: Sweep parameters and dispatch thousands of Slurm jobs from one Python script
- internals: Interactive, first-principles tutorials for modern AI systems & system components.
- Speculative Decoding: Interactive walkthrough of how LLMs emit several tokens per forward pass; same output, way fewer passes.
- nabla: Educational numpy implementations of 15 optimizers (SGD → Muon), animated on a 2D saddle & benchmarked on matrix LS.