Skip to content
View williamjshipman's full-sized avatar

Block or report williamjshipman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]

Jupyter Notebook 297 78 Updated Oct 23, 2020

Python implementation of DDQN multi-UAV data harvesting

Python 214 49 Updated Jan 13, 2022

Reinforcement learning tutorials

Python 411 157 Updated Mar 25, 2023

Deep Q-Learning (DQN) implementation for Atari pong.

Python 86 16 Updated Nov 22, 2022

A neurosymbolic perspective on LLMs

Python 1,716 86 Updated May 14, 2026

PyTorch implementations of MADDPG, MAPPO (coming)

Python 205 21 Updated Mar 6, 2024

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Python 538 81 Updated Jul 21, 2023

OpenUI5 lets you build enterprise-ready web applications, responsive to all devices, running on almost any browser of your choice.

JavaScript 3,285 1,270 Updated May 17, 2026

The Open Standard for Generative UI

TypeScript 5,918 412 Updated May 16, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

133,829 13,673 Updated Apr 20, 2026

OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of context (memory, resources, and skills) that Agents need th…

Python 24,010 1,789 Updated May 17, 2026

Zotero MCP: Connects your Zotero research library with Claude and other AI assistants via the Model Context Protocol to discuss papers, get summaries, analyze citations, and more.

Python 3,165 283 Updated Apr 9, 2026

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

TypeScript 55,488 2,634 Updated May 15, 2026

🔥 Datasets and env wrappers for offline safe reinforcement learning

Python 132 7 Updated Nov 12, 2025

Prioritized Experience Replay (PER) implementation in PyTorch

Python 360 73 Updated Feb 3, 2020

Lua/Torch implementation of DQN (Nature, 2015)

Lua 633 168 Updated Apr 6, 2017

[ICRA 2023] Adaptive and Explainable Deployment of Navigation Skills via Hierarchical Deep Reinforcement Learning

Python 137 9 Updated Jan 31, 2024

A research-validated stethoscope whose plans are available Freely and openly. The cost of the entire stethoscope is between $2.5 to $5 to produce

Ruby 993 112 Updated Apr 28, 2026

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 349,034 56,264 Updated Mar 20, 2026

📚 Freely available programming books

Python 388,466 66,306 Updated May 12, 2026

A collective list of free APIs

Python 435,509 47,740 Updated May 15, 2026

A rewrite of the old legacy software "depends.exe" in C# for Windows devs to troubleshoot dll load dependencies issues.

C# 11,493 927 Updated May 15, 2024

ASP.NET Core is a cross-platform .NET framework for building modern cloud-based web applications on Windows, Mac, or Linux.

C# 37,932 10,651 Updated May 17, 2026

Aspire is the tool for code-first, extensible, observable dev and deploy.

C# 5,946 882 Updated May 17, 2026

The OpenTelemetry .NET Client

C# 3,698 891 Updated May 13, 2026

Develop Desktop, Embedded, Mobile and WebAssembly apps with C# and XAML. The future of .NET UI

C# 30,814 2,693 Updated May 15, 2026

Simple A3C implementation with pytorch + multiprocessing

Python 657 144 Updated Mar 10, 2023

Use Codex from Claude Code to review code or delegate tasks.

JavaScript 18,903 1,111 Updated Apr 18, 2026

An OpenaAIGym-based framework allowing to test hybrid approaches (RL + path planning) for multi-UAV systems that are supposed to provide smart services.

Python 147 17 Updated Dec 19, 2023
TypeScript 7,149 892 Updated May 17, 2026
Next