bpesquet

Follow

Baptiste Pesquet bpesquet

Follow

Software engineer, computer science professor, AI PhD student.

722 followers · 0 following

ENS Cognitique
Bordeaux, France
https://www.bpesquet.fr

Achievements

Achievements

Highlights

Pro

Lists (2)

Sort

Decision-making

Homemade ML

20 repositories

Stars

stanford-cs336 / assignment1-basics

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 1,971 2,212 Updated Apr 7, 2026

jeanharb / option_critic

Implementation of the Option-Critic Architecture on the Atari (ALE) environment

Python 184 56 Updated Sep 21, 2017

lweitkamp / option-critic-pytorch

PyTorch implementation of the Option-Critic framework, Harb et al. 2016

Python 145 48 Updated Aug 2, 2024

GaetanLepage / phd-manuscript

Typst 12 Updated May 14, 2026

brentyi / tyro

CLI interfaces & config objects, from types

Python 1,063 46 Updated Jun 12, 2026

ashworks1706 / rlhf-from-scratch

A theoretical and practical deep dive into Reinforcement Learning with Human Feedback and it’s applications in Large Language Models from scratch.

Jupyter Notebook 114 11 Updated Nov 7, 2025

olivkoch / ensae-dl-2026

Deep learning class at ENSAE

Jupyter Notebook 8 11 Updated Mar 12, 2026

huguettecl / MHN_minimal

Minimal implementation of Modern Hopfield Networks in PyTorch

Python 2 Updated Sep 4, 2024

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,472 1,557 Updated May 26, 2026

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 54,909 7,470 Updated May 5, 2026

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,676 4,965 Updated Aug 9, 2024

anthropics / prompt-eng-interactive-tutorial

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 36,301 3,942 Updated Mar 1, 2026

Farama-Foundation / Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Python 2,463 642 Updated May 30, 2026

koulanurag / ma-gym

A collection of multi agent environments based on OpenAI gym.

Python 632 114 Updated Jul 7, 2024

little-book-of / linear-algebra

A concise, beginner-friendly introduction to the core ideas of linear algebra.

Jupyter Notebook 1,977 64 Updated Mar 16, 2026

vikashplus / Adroit

ShadowHand / ADROIT MuJoCo models

78 5 Updated Feb 25, 2023

itsdfish / SequentialSamplingModels.jl

A unified interface for simulating and evaluating sequential sampling models in Julia.

Julia 35 6 Updated May 27, 2026

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 26,929 6,253 Updated Apr 24, 2026

cpesr / emploiESR

Données et représentations à propos de l'emploi dans l'ESR.

HTML 6 1 Updated Mar 18, 2025

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

30,180 2,424 Updated Jun 18, 2024

suitenumerique / docs

A collaborative note taking, wiki and documentation platform that scales. Built with Django and React.

Python 16,582 596 Updated Jun 11, 2026

giant-axon / lu.i-neuron-pcb

AMPL 67 7 Updated Jun 18, 2025

rosvik / thesis-template

A template for writing your thesis with LaTeX in VS Code

Jupyter Notebook 5 1 Updated Feb 1, 2023

google / styleguide

Style guides for Google-originated open-source projects

HTML 39,360 12,970 Updated Jun 3, 2026

qihongl / pylca

A python implementation of the leaky, competing, accumulator (Usher, & McClelland, 2001).

Jupyter Notebook 3 1 Updated Mar 30, 2019

qgallouedec / panda-gym

Set of robotic environments based on PyBullet physics engine and gymnasium.

Python 757 134 Updated Jul 23, 2024

foreverska / buffalo-gym

Multi-armed Bandit Gymnasium Environment

Python 6 3 Updated Sep 6, 2025

arminbahl / random_dot_motion

A simple toolbox for displaying random dot motion stimuli

Python 4 2 Updated Jan 4, 2020

TimoFlesch / pygame_rdk

a random dot kinematogram (RDK) implemented with PyGame

Python 5 3 Updated Sep 19, 2019

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,945 1,099 Updated Apr 20, 2026