Skip to content
View norman0686's full-sized avatar

Block or report norman0686

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Really Fast End-to-End Jax RL Implementations

Python 1,004 82 Updated Sep 9, 2024

A nicer way to view SEC 13F filings data

Ruby 362 88 Updated Aug 1, 2024

An educational resource to help anyone learn deep reinforcement learning.

Python 11,450 2,413 Updated Aug 5, 2024

Qwen Code is a coding agent that lives in the digital world.

TypeScript 16,565 1,414 Updated Dec 20, 2025

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 901 120 Updated Mar 23, 2024

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 88,025 10,090 Updated Dec 20, 2025

Rayon: A data parallelism library for Rust

Rust 12,466 565 Updated Oct 28, 2025

A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ...

Rust 30,546 2,856 Updated Dec 18, 2025

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Python 682 226 Updated Dec 8, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,366 2,018 Updated Dec 18, 2025

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 58,221 2,341 Updated Dec 17, 2025

Examples and guides for using the OpenAI API

Jupyter Notebook 69,847 11,735 Updated Dec 19, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,187 2,684 Updated Aug 12, 2024

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,520 1,315 Updated Dec 18, 2025

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,481 5,835 Updated Aug 14, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,557 2,080 Updated Apr 16, 2024

A collection of simple python mini projects to enhance your python skills

Python 17,419 5,777 Updated Jul 17, 2022

WebSocket cat

JavaScript 2,628 248 Updated May 3, 2025

Package gorilla/websocket is a fast, well-tested and widely used WebSocket implementation for Go.

Go 24,337 3,575 Updated Mar 19, 2025

Package for comparing Go values in tests

Go 4,567 219 Updated Feb 21, 2025

A toolkit with common assertions and mocks that plays nicely with the standard library

Go 25,545 1,686 Updated Nov 28, 2025

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 23,622 9,781 Updated Sep 1, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

70,733 8,094 Updated Jun 4, 2025

Fully open reproduction of DeepSeek-R1

Python 25,739 2,405 Updated Nov 24, 2025

A DLNA, UPnP and HTTP(S) Media Server.

Java 2,538 493 Updated Dec 17, 2025

Native Jellyfin Client for iOS and tvOS

Swift 3,527 424 Updated Dec 20, 2025

SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms

Python 540 53 Updated Dec 15, 2025

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,666 758 Updated Oct 1, 2025

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,668 579 Updated Dec 15, 2025
Next