Stars
Really Fast End-to-End Jax RL Implementations
A nicer way to view SEC 13F filings data
An educational resource to help anyone learn deep reinforcement learning.
Qwen Code is a coding agent that lives in the digital world.
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
An open-source AI agent that brings the power of Gemini directly into your terminal.
A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ...
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
ripgrep recursively searches directories for a regex pattern while respecting your gitignore
Examples and guides for using the OpenAI API
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Unsupervised text tokenizer for Neural Network-based text generation.
Code for the paper "Language Models are Unsupervised Multitask Learners"
A PyTorch implementation of the Transformer model in "Attention is All You Need".
A collection of simple python mini projects to enhance your python skills
Package gorilla/websocket is a fast, well-tested and widely used WebSocket implementation for Go.
A toolkit with common assertions and mocks that plays nicely with the standard library
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Fully open reproduction of DeepSeek-R1
A DLNA, UPnP and HTTP(S) Media Server.
SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms
This repo contains the Hugging Face Deep Reinforcement Learning Course.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.