tugot17

Piotr Mazurek tugot17

Making LLMs go brrr @Aleph__Alpha

128 followers · 50 following

Achievements

x3 x2

Achievements

x3 x2

Stars

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,110 299 Updated Nov 6, 2025

brendanhogan / nano-grpo-envs

Python 10 1 Updated Oct 17, 2025

patrick-kidger / jaxtyping

Type annotations and runtime checking for shape and dtype of JAX/NumPy/PyTorch/etc. arrays. https://docs.kidger.site/jaxtyping/

Python 1,623 80 Updated Oct 3, 2025

srush / Tensor-Puzzles

Solve puzzles. Improve your pytorch.

Jupyter Notebook 3,766 336 Updated Jul 15, 2024

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 287 27 Updated Nov 6, 2025

NousResearch / atropos

Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments

Python 738 165 Updated Nov 7, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 968 67 Updated Oct 2, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,765 99 Updated Mar 18, 2025

alexarmbr / matmul-playground

Cuda 19 5 Updated Apr 7, 2025

thinking-machines-lab / batch_invariant_ops

Python 883 68 Updated Nov 4, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,974 1,294 Updated Nov 3, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,078 248 Updated Nov 6, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,160 161 Updated Nov 7, 2025

LeonGuertler / TextArena

A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning

Python 302 69 Updated Oct 29, 2025

gepa-ai / gepa

Optimize prompts, code, and more with AI-powered Reflective Text Evolution

Jupyter Notebook 1,495 107 Updated Nov 6, 2025

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,393 244 Updated Nov 7, 2025

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 748 125 Updated Nov 7, 2025

JannikSt / ibtop

Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects

Rust 45 1 Updated Sep 3, 2025

Tencent-Hunyuan / HunyuanWorld-Voyager

Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.

Python 1,321 124 Updated Oct 22, 2025

ChenmienTan / RL2

Python 910 96 Updated Nov 7, 2025

PrimeIntellect-ai / prime-environments

Training-Ready RL Environments + Evals

Python 164 181 Updated Nov 7, 2025

Tencent-Hunyuan / Hunyuan-GameCraft-1.0

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Python 617 68 Updated Oct 16, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,656 596 Updated Nov 7, 2025

mlfoundations / evalchemy

Automatic evals for LLMs

HTML 552 67 Updated Jun 27, 2025

PrimeIntellect-ai / verifiers

Environments for LLM Reinforcement Learning

Python 3,462 426 Updated Nov 7, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,193 2,439 Updated Nov 6, 2025

Aleph-Alpha / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 17 Updated Oct 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Piotr Mazurek tugot17

Achievements

Achievements

Block or report tugot17

Stars

PufferAI / PufferLib

brendanhogan / nano-grpo-envs

patrick-kidger / jaxtyping

srush / Tensor-Puzzles

ServiceNow / PipelineRL

NousResearch / atropos

deepseek-ai / DeepSeek-V3.2-Exp

PRIME-RL / PRIME

alexarmbr / matmul-playground

thinking-machines-lab / batch_invariant_ops

Alibaba-NLP / DeepResearch

zhaochenyang20 / Awesome-ML-SYS-Tutorial

NovaSky-AI / SkyRL

LeonGuertler / TextArena

gepa-ai / gepa

THUDM / slime

PrimeIntellect-ai / prime-rl

JannikSt / ibtop

Tencent-Hunyuan / HunyuanWorld-Voyager

ChenmienTan / RL2

PrimeIntellect-ai / prime-environments

Tencent-Hunyuan / Hunyuan-GameCraft-1.0

pytorch / torchtitan

mlfoundations / evalchemy

PrimeIntellect-ai / verifiers

volcengine / verl

Aleph-Alpha / vllm

QwenLM / qwen-code

BerriAI / litellm

musistudio / claude-code-router