Skip to content
View zetaqubit's full-sized avatar

Block or report zetaqubit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 583 59 Updated Jul 11, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,956 221 Updated Mar 8, 2024

Inference Llama 2 in one file of pure C

C 19,644 2,565 Updated Aug 6, 2024

Making Reddit data accessible to researchers, moderators and everyone else. Interact with the data through large dumps, an API or web interface.

TypeScript 1,139 71 Updated Jun 11, 2026

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,976 1,104 Updated Apr 20, 2026

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,463 141 Updated Jun 17, 2026

A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer

Python 1,663 169 Updated Sep 15, 2023

This is optimized firmware for Ender3 V2/S1 3D printers.

C++ 3,141 434 Updated Jun 5, 2026

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,567 1,006 Updated Jun 13, 2026

Example scripts for the pushshift dump files

Python 492 86 Updated Jun 14, 2026

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,900 843 Updated May 29, 2022

Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)

Python 9,524 399 Updated May 31, 2026

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,848 649 Updated May 25, 2024

2D Game Physics for Python

Python 510 94 Updated Nov 29, 2024

PyTorch Tutorial for Deep Learning Researchers

Python 32,387 8,242 Updated Aug 15, 2023

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Jupyter Notebook 3,178 591 Updated Nov 4, 2021

The hacker's browser.

JavaScript 26,601 2,584 Updated May 26, 2026