Skip to content
View SamComber's full-sized avatar

Organizations

@GDSL-UL

Block or report SamComber

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Async RL Training at Scale

Python 1,085 209 Updated Feb 18, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 53,400 9,039 Updated Nov 12, 2025
Python 1,132 116 Updated Jan 19, 2026

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 917 122 Updated Mar 23, 2024

Python Module Dependency graphs

Python 2,054 132 Updated Jan 6, 2026

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 78,508 9,290 Updated Feb 18, 2026

Our library for RL environments + evals

Python 3,842 494 Updated Feb 18, 2026

Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.

Python 65 7 Updated May 5, 2025

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,765 775 Updated Dec 28, 2025

Python code, PDFs and resources for the series of posts on Reinforcement Learning which I published on my personal blog

Python 624 179 Updated May 2, 2023

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,513 206 Updated Feb 18, 2026
Python 164 11 Updated Jan 21, 2025

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,537 413 Updated Nov 12, 2025

experiments with inference on llama

Python 103 16 Updated Jun 6, 2024

An interactive dashboard to display Formula 1 data and statistics

Python 13 1 Updated Aug 3, 2021

A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations

Python 32 13 Updated Dec 21, 2023

A tool for generating .pex (Python EXecutable) files, lock files and venvs.

Python 4,183 309 Updated Feb 17, 2026
Python 24 5 Updated Dec 13, 2022

Simple transformer implementation from scratch in pytorch. (archival, latest version on codeberg)

Python 1,095 172 Updated Mar 20, 2025

I will build Transformer from scratch

Python 85 12 Updated Jul 21, 2025

nannyml: post-deployment data science in python

Python 2,122 177 Updated Jul 12, 2025

🌊 Online machine learning in Python

Python 5,717 606 Updated Feb 9, 2026

Implementation of Bayesian Hyperparameter Optimization of Machine Learning Algorithms

Jupyter Notebook 639 322 Updated Apr 29, 2023

Spell checking pre-commit Git hook.

Shell 90 16 Updated Oct 5, 2019

Uplift modeling and causal inference with machine learning algorithms

Python 5,740 852 Updated Feb 15, 2026

Imputation of missing values in tables.

493 70 Updated Jan 14, 2026

Python API for Deequ

Jupyter Notebook 813 149 Updated Jan 21, 2026

Distributed Asynchronous Hyperparameter Optimization in Python

Python 7,619 1,077 Updated Feb 8, 2026
Next