Skip to content
View bpesquet's full-sized avatar

Highlights

  • Pro

Block or report bpesquet

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 1,971 2,212 Updated Apr 7, 2026

Implementation of the Option-Critic Architecture on the Atari (ALE) environment

Python 184 56 Updated Sep 21, 2017

PyTorch implementation of the Option-Critic framework, Harb et al. 2016

Python 145 48 Updated Aug 2, 2024
Typst 12 Updated May 14, 2026

CLI interfaces & config objects, from types

Python 1,063 46 Updated Jun 12, 2026

A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.

Jupyter Notebook 114 11 Updated Nov 7, 2025

Deep learning class at ENSAE

Jupyter Notebook 8 11 Updated Mar 12, 2026

Minimal implementation of Modern Hopfield Networks in PyTorch

Python 2 Updated Sep 4, 2024

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,472 1,557 Updated May 26, 2026

The best ChatGPT that $100 can buy.

Python 54,909 7,470 Updated May 5, 2026

Python Implementation of Reinforcement Learning: An Introduction

Python 14,676 4,965 Updated Aug 9, 2024

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 36,301 3,942 Updated Mar 1, 2026

Simple and easily configurable grid world environments for reinforcement learning

Python 2,463 642 Updated May 30, 2026

A collection of multi agent environments based on OpenAI gym.

Python 632 114 Updated Jul 7, 2024

A concise, beginner-friendly introduction to the core ideas of linear algebra.

Jupyter Notebook 1,977 64 Updated Mar 16, 2026

ShadowHand / ADROIT MuJoCo models

78 5 Updated Feb 25, 2023

A unified interface for simulating and evaluating sequential sampling models in Julia.

Julia 35 6 Updated May 27, 2026

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 26,929 6,253 Updated Apr 24, 2026

Données et représentations à propos de l'emploi dans l'ESR.

HTML 6 1 Updated Mar 18, 2025

A playbook for systematically maximizing the performance of deep learning models.

30,180 2,424 Updated Jun 18, 2024

A collaborative note taking, wiki and documentation platform that scales. Built with Django and React.

Python 16,582 596 Updated Jun 11, 2026

A template for writing your thesis with LaTeX in VS Code

Jupyter Notebook 5 1 Updated Feb 1, 2023

Style guides for Google-originated open-source projects

HTML 39,360 12,970 Updated Jun 3, 2026

A python implementation of the leaky, competing, accumulator (Usher, & McClelland, 2001).

Jupyter Notebook 3 1 Updated Mar 30, 2019

Set of robotic environments based on PyBullet physics engine and gymnasium.

Python 757 134 Updated Jul 23, 2024

Multi-armed Bandit Gymnasium Environment

Python 6 3 Updated Sep 6, 2025

A simple toolbox for displaying random dot motion stimuli

Python 4 2 Updated Jan 4, 2020

a random dot kinematogram (RDK) implemented with PyGame

Python 5 3 Updated Sep 19, 2019

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,945 1,099 Updated Apr 20, 2026
Next