tugot17

Follow

Piotr Mazurek tugot17

Follow

Making LLMs go brrr @Aleph__Alpha

128 followers · 50 following

Achievements

Achievements

Stars

203 results for source starred repositories

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,146 301 Updated Nov 9, 2025

brendanhogan / nano-grpo-envs

Python 10 1 Updated Oct 17, 2025

patrick-kidger / jaxtyping

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,627 80 Updated Oct 3, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,770 336 Updated Jul 15, 2024

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 293 28 Updated Nov 8, 2025

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 739 167 Updated Nov 7, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 969 69 Updated Oct 2, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,766 99 Updated Mar 18, 2025

alexarmbr / matmul-playground

Cuda 19 5 Updated Apr 7, 2025

thinking-machines-lab / batch_invariant_ops

Python 887 68 Updated Nov 4, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,016 1,297 Updated Nov 9, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,094 250 Updated Nov 6, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,170 163 Updated Nov 9, 2025

LeonGuertler / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 302 71 Updated Oct 29, 2025

gepa-ai / gepa

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,501 107 Updated Nov 8, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,421 245 Updated Nov 7, 2025

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 750 129 Updated Nov 8, 2025

JannikSt / ibtop

Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects

Rust 45 1 Updated Sep 3, 2025

Tencent-Hunyuan / HunyuanWorld-Voyager

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,330 125 Updated Oct 22, 2025

ChenmienTan / RL2

Python 911 96 Updated Nov 9, 2025

PrimeIntellect-ai / prime-environments

Training-Ready RL Environments + Evals

Python 165 182 Updated Nov 9, 2025

Tencent-Hunyuan / Hunyuan-GameCraft-1.0

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 617 68 Updated Oct 16, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,663 598 Updated Nov 9, 2025

mlfoundations / evalchemy

Automatic evals for LLMs

HTML 553 67 Updated Jun 27, 2025

PrimeIntellect-ai / verifiers

Environments for LLM Reinforcement Learning

Python 3,470 428 Updated Nov 9, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,263 2,454 Updated Nov 9, 2025

QwenLM / qwen-code

Qwen Code is a coding agent that lives in the digital world.

TypeScript 15,184 1,254 Updated Nov 9, 2025

BerriAI / litellm

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 30,848 4,635 Updated Nov 9, 2025

musistudio / claude-code-router

Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.

TypeScript 21,227 1,637 Updated Nov 6, 2025

supabase / cli

Supabase CLI. Manage postgres migrations, run Supabase locally, deploy edge functions. Postgres backups. Generating types from your database schema.

Go 1,457 311 Updated Nov 8, 2025