Skip to content
View Feryal's full-sized avatar

Organizations

@iccsw

Block or report Feryal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library of reinforcement learning components and agents

Python 4,005 541 Updated Apr 8, 2026

Using multiple sensor modalities to improve exploration for robotic manipulation tasks with sparse rewards

Shell 10 6 Updated Sep 17, 2019

Code for the paper "Quantifying Transfer in Reinforcement Learning"

C++ 408 88 Updated Oct 7, 2023

Latex code for making neural networks diagrams

TeX 24,831 3,069 Updated Aug 21, 2023

🚗 Rocket League Distributed Deep Reinforcement Learning Bot

Python 157 25 Updated May 10, 2019

LabNotebook is a tool that allows you to flexibly monitor, record, save, and query all your machine learning experiments.

Jupyter Notebook 528 38 Updated Mar 31, 2018

Collection of reinforcement learning algorithms

Python 2,909 571 Updated Jun 17, 2024

Multitask Environments for RL

Python 283 65 Updated Aug 23, 2021

For educational materials related to the spinning up workshops.

TeX 206 47 Updated Feb 12, 2019

Repo for reproduction of sequential social dilemmas

Python 417 136 Updated Mar 6, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 35,879 3,651 Updated Jun 23, 2026

An educational resource to help anyone learn deep reinforcement learning.

Python 11,824 2,454 Updated Aug 5, 2024

Google Research

Jupyter Notebook 38,209 8,441 Updated Jun 22, 2026

A Python toolbox for performing gradient-free optimization

Python 4,196 369 Updated Mar 16, 2026

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

Python 2,769 819 Updated Apr 9, 2024

Multi Agent Reinforcement Learning using MalmÖ

Python 266 44 Updated Apr 14, 2020

Variance Networks: When Expectation Does Not Meet Your Expectations, ICLR 2019

Python 39 2 Updated Jan 31, 2020

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,955 3,599 Updated Jul 28, 2024

the only cheat sheet you need

Python 41,476 1,915 Updated Dec 23, 2025

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

Jupyter Notebook 920 114 Updated Aug 27, 2019

Ordered Preference Elicitation Strategies for Multi-Objective Decision Making using Gaussian Processes

Python 23 6 Updated Jul 25, 2018

Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"

Python 1,633 280 Updated Oct 31, 2019

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-play…

Jupyter Notebook 423 62 Updated Jun 19, 2026

An official TensorFlow implementation of "Neural Program Synthesis from Diverse Demonstration Videos" (ICML 2018) by Shao-Hua Sun, Hyeonwoo Noh, Sriram Somasundaram, and Joseph J. Lim

Python 103 23 Updated Mar 24, 2023

Fine Tuning Language Models for Multilabel Prediction

Python 61 10 Updated Oct 30, 2022

Cluttered Omniglot dataset and models

Python 48 11 Updated May 3, 2019

Implementation of Conditionally Shifted Neurons by Munkhdalai et al. (https://arxiv.org/pdf/1712.09926.pdf)

Python 28 3 Updated Jul 8, 2018

This is the repository for the distill web framework

JavaScript 982 162 Updated Dec 5, 2022

Reference models and tools for Cloud TPUs.

Jupyter Notebook 5,281 1,754 Updated Jun 22, 2026

Neural scene representation and rendering (GQN)

Python 183 25 Updated Jun 12, 2019
Next