Skip to content
View marekm4's full-sized avatar

Organizations

@aloe-games

Block or report marekm4

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
13 stars written in Python
Clear filter

AI agents running research on single-GPU nanochat training automatically

Python 65,905 9,431 Updated Mar 26, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,093 9,557 Updated Nov 12, 2025

Download market data from Yahoo! Finance's API

Python 22,616 3,134 Updated Apr 3, 2026

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 13,027 2,095 Updated Apr 1, 2026

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 11,644 1,312 Updated Mar 28, 2026

Ollama Python library

Python 9,707 993 Updated Jan 23, 2026

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 9,485 1,038 Updated Jul 8, 2025

StarCraft II Learning Environment

Python 8,266 1,162 Updated Jul 23, 2024

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…

Python 3,887 843 Updated May 29, 2022

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Python 3,373 448 Updated Apr 5, 2026

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

Python 2,017 352 Updated Mar 30, 2026

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…

Python 363 59 Updated Nov 9, 2022