SamComber

Sam Comber SamComber

AI @ Deliveroo

20 followers · 12 following

Here, there, everywhere
London
https://scholar.google.com/citations?user=KYmFMxsAAAAJ&hl=en

Achievements

x3 x3

Achievements

x3 x3

Organizations

Stars

PrimeIntellect-ai / prime-rl

Agentic RL Training at Scale

Python 1,500 315 Updated Jun 22, 2026

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 59,968 10,339 Updated Nov 12, 2025

ChenmienTan / RL2

Python 1,292 133 Updated May 20, 2026

SiliangZeng / Multi-Turn-RL-Agent

Python 129 11 Updated Jun 11, 2025

vwxyzjn / ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 940 120 Updated Mar 23, 2024

thebjorn / pydeps

Python Module Dependency graphs

Python 2,100 134 Updated Jun 19, 2026

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 99,969 11,143 Updated Jun 20, 2026

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 4,216 562 Updated Jun 21, 2026

Danau5tin / calculator_agent_rl

Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.

Python 72 7 Updated May 5, 2025

huggingface / deep-rl-class

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,929 795 Updated May 26, 2026

mpatacchiola / dissecting-reinforcement-learning

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python 624 178 Updated May 2, 2023

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,707 225 Updated Apr 14, 2026

bytarnish / AGILE

Python 166 11 Updated Jan 21, 2025

ufal / whisper_streaming

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,642 409 Updated Nov 12, 2025

hamelsmu / llama-inference

experiments with inference on llama

Python 103 16 Updated Jun 6, 2024

adenhaus / f1-data-viz

An interactive dashboard to display Formula 1 data and statistics

Python 13 1 Updated Aug 3, 2021

guyfe / Tweetsumm

A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations

Python 32 13 Updated Dec 21, 2023

pex-tool / pex

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,219 312 Updated Jun 21, 2026

erain / bazel-python-example

Python 24 5 Updated Dec 13, 2022

pbloem / former

Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)

Python 1,098 173 Updated Mar 20, 2025

Khaliladib11 / Transformer-from-scratch

I will build Transformer from scratch

Python 91 13 Updated Jul 21, 2025

NannyML / nannyml

nannyml: post-deployment data science in python

Python 2,142 186 Updated Jul 12, 2025

online-ml / river

🌊 Online machine learning in Python

Python 5,851 635 Updated Jun 19, 2026

WillKoehrsen / hyperparameter-optimization

Implementation of Bayesian Hyperparameter Optimization of Machine Learning Algorithms

Jupyter Notebook 641 317 Updated Apr 29, 2023

mprpic / git-spell-check

Spell checking pre-commit Git hook.

Shell 90 16 Updated Oct 5, 2019

uber / causalml

Uplift modeling and causal inference with machine learning algorithms

Python 5,880 860 Updated Jun 20, 2026

aleksandramiesiac / UpliftModelling_Iml_team4

Jupyter Notebook 3 2 Updated Jun 5, 2020

awslabs / datawig

Imputation of missing values in tables.

492 70 Updated Jan 14, 2026

awslabs / python-deequ

Python API for Deequ

Jupyter Notebook 822 155 Updated Jun 11, 2026

hyperopt / hyperopt

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,585 1,075 Updated Jun 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sam Comber SamComber

Achievements

Achievements

Organizations

Block or report SamComber

Stars

PrimeIntellect-ai / prime-rl

karpathy / nanoGPT

ChenmienTan / RL2

SiliangZeng / Multi-Turn-RL-Agent

vwxyzjn / ppo-implementation-details

thebjorn / pydeps

browser-use / browser-use

PrimeIntellect-ai / verifiers

Danau5tin / calculator_agent_rl

huggingface / deep-rl-class

mpatacchiola / dissecting-reinforcement-learning

mll-lab-nu / RAGEN

bytarnish / AGILE

ufal / whisper_streaming

hamelsmu / llama-inference

adenhaus / f1-data-viz

guyfe / Tweetsumm

pex-tool / pex

erain / bazel-python-example

pbloem / former

Khaliladib11 / Transformer-from-scratch

NannyML / nannyml

online-ml / river

WillKoehrsen / hyperparameter-optimization

mprpic / git-spell-check

uber / causalml

aleksandramiesiac / UpliftModelling_Iml_team4

awslabs / datawig

awslabs / python-deequ

hyperopt / hyperopt