Skip to content
View SamComber's full-sized avatar

Organizations

@doordash @creditornot @deliveroo @GDSL-UL

Block or report SamComber

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agentic RL Training at Scale

Python 1,493 314 Updated Jun 19, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,868 10,327 Updated Nov 12, 2025
Python 1,292 133 Updated May 20, 2026

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 941 120 Updated Mar 23, 2024

Python Module Dependency graphs

Python 2,099 134 Updated Jun 16, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 99,567 11,104 Updated Jun 15, 2026

Our library for RL environments + evals

Python 4,206 561 Updated Jun 19, 2026

Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.

Python 72 7 Updated May 5, 2025

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,926 794 Updated May 26, 2026

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python 624 178 Updated May 2, 2023

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,707 225 Updated Apr 14, 2026
Python 166 11 Updated Jan 21, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,641 409 Updated Nov 12, 2025

experiments with inference on llama

Python 103 16 Updated Jun 6, 2024

An interactive dashboard to display Formula 1 data and statistics

Python 13 1 Updated Aug 3, 2021

A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations

Python 32 13 Updated Dec 21, 2023

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,218 312 Updated Jun 19, 2026
Python 24 5 Updated Dec 13, 2022

Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)

Python 1,098 173 Updated Mar 20, 2025

I will build Transformer from scratch

Python 91 13 Updated Jul 21, 2025

nannyml: post-deployment data science in python

Python 2,142 186 Updated Jul 12, 2025

🌊 Online machine learning in Python

Python 5,847 632 Updated Jun 19, 2026

Implementation of Bayesian Hyperparameter Optimization of Machine Learning Algorithms

Jupyter Notebook 641 317 Updated Apr 29, 2023

Spell checking pre-commit Git hook.

Shell 90 16 Updated Oct 5, 2019

Uplift modeling and causal inference with machine learning algorithms

Python 5,876 859 Updated Jun 17, 2026

Imputation of missing values in tables.

492 70 Updated Jan 14, 2026

Python API for Deequ

Jupyter Notebook 822 155 Updated Jun 11, 2026

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,584 1,074 Updated Jun 8, 2026
Next