tugot17

Piotr Mazurek tugot17

Making LLMs go brrr @Aleph__Alpha

130 followers · 51 following

Achievements

x3 x2

Achievements

x3 x2

Stars

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,198 305 Updated Nov 14, 2025

brendanhogan / nano-grpo-envs

Python 10 1 Updated Oct 17, 2025

patrick-kidger / jaxtyping

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,638 80 Updated Oct 3, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,773 339 Updated Jul 15, 2024

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 299 28 Updated Nov 14, 2025

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 744 168 Updated Nov 14, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 978 70 Updated Oct 2, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,766 99 Updated Mar 18, 2025

alexarmbr / matmul-playground

Cuda 19 5 Updated Apr 7, 2025

thinking-machines-lab / batch_invariant_ops

Python 895 68 Updated Nov 4, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,154 1,302 Updated Nov 10, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,156 253 Updated Nov 10, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,196 167 Updated Nov 13, 2025

LeonGuertler / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 310 75 Updated Oct 29, 2025

gepa-ai / gepa

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,551 113 Updated Nov 12, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,472 253 Updated Nov 14, 2025

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 768 133 Updated Nov 14, 2025

JannikSt / ibtop

Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects

Rust 45 1 Updated Sep 3, 2025

Tencent-Hunyuan / HunyuanWorld-Voyager

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,339 128 Updated Oct 22, 2025

ChenmienTan / RL2

Python 915 97 Updated Nov 13, 2025

PrimeIntellect-ai / prime-environments

Training-Ready RL Environments + Evals

Python 174 187 Updated Nov 14, 2025

Tencent-Hunyuan / Hunyuan-GameCraft-1.0

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 617 68 Updated Oct 16, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,706 601 Updated Nov 14, 2025

mlfoundations / evalchemy

Automatic evals for LLMs

HTML 557 68 Updated Jun 27, 2025

PrimeIntellect-ai / verifiers

Environments for LLM Reinforcement Learning

Python 3,485 431 Updated Nov 14, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,627 2,522 Updated Nov 14, 2025

Aleph-Alpha / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 17 Updated Nov 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Piotr Mazurek tugot17

Achievements

Achievements

Block or report tugot17

Stars

PufferAI / PufferLib

brendanhogan / nano-grpo-envs

patrick-kidger / jaxtyping

srush / Tensor-Puzzles

ServiceNow / PipelineRL

NousResearch / atropos

deepseek-ai / DeepSeek-V3.2-Exp

PRIME-RL / PRIME

alexarmbr / matmul-playground

thinking-machines-lab / batch_invariant_ops

Alibaba-NLP / DeepResearch

zhaochenyang20 / Awesome-ML-SYS-Tutorial

NovaSky-AI / SkyRL

LeonGuertler / TextArena

gepa-ai / gepa

THUDM / slime

PrimeIntellect-ai / prime-rl

JannikSt / ibtop

Tencent-Hunyuan / HunyuanWorld-Voyager

ChenmienTan / RL2

PrimeIntellect-ai / prime-environments

Tencent-Hunyuan / Hunyuan-GameCraft-1.0

pytorch / torchtitan

mlfoundations / evalchemy

PrimeIntellect-ai / verifiers

volcengine / verl

Aleph-Alpha / vllm

QwenLM / qwen-code

BerriAI / litellm

musistudio / claude-code-router