norman0686

Norman Qing norman0686

2 followers · 2 following

Stars

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,004 82 Updated Sep 9, 2024

toddwschneider / sec-13f-filings

A nicer way to view SEC 13F filings data

Ruby 362 88 Updated Aug 1, 2024

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 11,450 2,413 Updated Aug 5, 2024

QwenLM / qwen-code

Qwen Code is a coding agent that lives in the digital world.

TypeScript 16,565 1,414 Updated Dec 20, 2025

vwxyzjn / ppo-implementation-details

The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization

Python 901 120 Updated Mar 23, 2024

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 88,025 10,090 Updated Dec 20, 2025

rayon-rs / rayon

Rayon: A data parallelism library for Rust

Rust 12,466 565 Updated Oct 28, 2025

tokio-rs / tokio

A runtime for writing reliable asynchronous applications with Rust. Provides I/O, networking, scheduling, timers, ...

Rust 30,546 2,856 Updated Dec 18, 2025

Stable-Baselines-Team / stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

Python 682 226 Updated Dec 8, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 12,366 2,018 Updated Dec 18, 2025

BurntSushi / ripgrep

ripgrep recursively searches directories for a regex pattern while respecting your gitignore

Rust 58,221 2,341 Updated Dec 17, 2025

openai / openai-cookbook

Examples and guides for using the OpenAI API

Jupyter Notebook 69,847 11,735 Updated Dec 19, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,187 2,684 Updated Aug 12, 2024

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,520 1,315 Updated Dec 18, 2025

openai / gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,481 5,835 Updated Aug 14, 2024

ThePrimeagen / init.lua

Lua 3,906 686 Updated Dec 11, 2025

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 9,557 2,080 Updated Apr 16, 2024

Python-World / python-mini-projects

A collection of simple python mini projects to enhance your python skills

Python 17,419 5,777 Updated Jul 17, 2022

websockets / wscat

WebSocket cat

JavaScript 2,628 248 Updated May 3, 2025

gorilla / websocket

Package gorilla/websocket is a fast, well-tested and widely used WebSocket implementation for Go.

Go 24,337 3,575 Updated Mar 19, 2025

google / go-cmp

Package for comparing Go values in tests

Go 4,567 219 Updated Feb 21, 2025

stretchr / testify

A toolkit with common assertions and mocks that plays nicely with the standard library

Go 25,545 1,686 Updated Nov 28, 2025

pytorch / examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Python 23,622 9,781 Updated Sep 1, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

70,733 8,094 Updated Jun 4, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,739 2,405 Updated Nov 24, 2025

UniversalMediaServer / UniversalMediaServer

A DLNA, UPnP and HTTP(S) Media Server.

Java 2,538 493 Updated Dec 17, 2025

jellyfin / Swiftfin

Native Jellyfin Client for iOS and tvOS

Swift 3,527 424 Updated Dec 20, 2025

araffin / sbx

SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms

Python 540 53 Updated Dec 15, 2025

huggingface / deep-rl-class

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,666 758 Updated Oct 1, 2025

DLR-RM / rl-baselines3-zoo

A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.

Python 2,668 579 Updated Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly