dantp-ai

Follow

Daniel Plop dantp-ai

Follow

16 followers · 4 following

Achievements

Achievements

Organizations

dantp-ai/README.md

Howdy, I'm Daniel 👋.

I am a Senior Machine Learning Software Engineer.

Focused on reinforcement learning, AI infrastructure, and building reliable and scalable software for AI systems.

Projects

Reinforcement Learning & Robotics

gym-puddle: Off-policy PAC algorithm implemented on the Puddle World Gymnasium environment using TorchRL
proprio: Unsupervised, uncertainty-aware perception for a 7-DOF robot arm; classifies each lidar reading as self, background, or anomaly, without any geometry or kinematics.

Tooling

AlphaEx: Sweep parameters and dispatch thousands of Slurm jobs from one Python script

Educational

internals: Interactive, first-principles tutorials for modern AI systems & system components.
- Speculative Decoding: Interactive walkthrough of how LLMs emit several tokens per forward pass; same output, way fewer passes.
nabla: Educational numpy implementations of 15 optimizers (SGD → Muon), animated on a 2D saddle & benchmarked on matrix LS.

GitHub Activity

Latest Blog Posts

Review on the Technical Report: Gemini Robotics 1.5

Pinned Loading

tianshou tianshou Public

Forked from thu-ml/tianshou

An elegant PyTorch deep reinforcement learning library.

Python
clawloop clawloop Public

Forked from aganthos/clawloop

Make your agents learn from experience. One protocol for weights, harness and routing.

Python
deep-rl-algos-methods deep-rl-algos-methods Public

Jupyter Notebook
minitorch minitorch Public template

Forked from minitorch/minitorch

The full minitorch student suite.

Python
proprio proprio Public

Forked from georgosgeorgos/DLRC_2018

Statistical Models for Robotic Perception

Jupyter Notebook
gym-puddle gym-puddle Public

Forked from EhsanEI/gym-puddle

Continuous grid-world environment for RL using Gymnasium

Python