Skip to content
View roger-creus's full-sized avatar

Block or report roger-creus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
roger-creus/README.md

Hi 👋, I'm Roger!

PhD Student in AI at Université de Montréal and Mila Quebec AI Institute

Supervised by Professors Pablo Samuel Castro and Glen Berseth

  • 🔬 Building general autonomous agents with Reinforcement Learning and Foundation Models.
  • 💻 I love building RL infrastructure — I'm a core contributor to CleanRL and RLLTE, and have developed frameworks like RLeXplore and others for my papers.
  • 📂 Check out my pinned repositories for open-source projects and research code!
  • 🌐 Personal webpage • 📫 roger[dot]creus[dash]castanyer[at]mila[dot]quebec

Coding

Pinned Loading

  1. agentick agentick Public

    Universal benchmark for evaluating AI agents 🚀

    Python 10

  2. vwxyzjn/cleanrl vwxyzjn/cleanrl Public

    High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

    Python 10k 1.1k

  3. stable-deep-rl-at-scale stable-deep-rl-at-scale Public

    Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments!

    Python 39 2

  4. RLE-Foundation/RLeXplore RLE-Foundation/RLeXplore Public

    RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven explora…

    Jupyter Notebook 465 23

  5. xgenius xgenius Public

    Autonomous research at scale with Claude and SLURM 🚀

    Python 13 2