nkalyanv99

nkalyanv99

1 follower · 0 following

Achievements

Stars

amazon-far / holosoma-extensions

Python 37 5 Updated Dec 1, 2025

nkalyanv99 / UNI-D2

Python 47 4 Updated Dec 21, 2025

pengzhangzhi / Open-dLLM

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 488 34 Updated Nov 11, 2025

taodav / pobax

Partially Observable Benchmarks in JAX

Python 21 4 Updated Dec 1, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,665 2,861 Updated Dec 21, 2025

TheodoreWolf / hyperoptax

Parallel hyperparameter tuning with JAX

Python 38 Updated Jul 21, 2025

PiotrNawrot / sparse-frontier

The evaluation framework for training-free sparse attention in LLMs

Python 106 8 Updated Oct 13, 2025

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,666 156 Updated Dec 5, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,447 194 Updated Dec 3, 2025

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 3,654 453 Updated Dec 21, 2025

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 376 44 Updated Oct 29, 2025

DramaCow / jaxued

Python 89 12 Updated Sep 9, 2025

meta-pytorch / LeanRL

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 663 28 Updated Aug 22, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,705 4,100 Updated Dec 20, 2025

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,005 82 Updated Sep 9, 2024

epignatelli / navix

Accelerated minigrid environments with JAX

Python 153 21 Updated Oct 20, 2025

CleanDiffuserTeam / CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 673 65 Updated Apr 20, 2025

PatrickHua / SimSiam

A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'

Python 836 135 Updated Jul 14, 2022

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,908 3,787 Updated Dec 21, 2025

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,642 345 Updated Dec 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nkalyanv99

Achievements

Achievements

Block or report nkalyanv99

Stars

amazon-far / holosoma-extensions

nkalyanv99 / UNI-D2

pengzhangzhi / Open-dLLM

taodav / pobax

volcengine / verl

TheodoreWolf / hyperoptax

PiotrNawrot / sparse-frontier

Simple-Efficient / RL-Factory

mll-lab-nu / RAGEN

PrimeIntellect-ai / verifiers

EdanToledo / Stoix

DramaCow / jaxued

meta-pytorch / LeanRL

unslothai / unsloth

luchris429 / purejaxrl

epignatelli / navix

CleanDiffuserTeam / CleanDiffuser

PatrickHua / SimSiam

tinygrad / tinygrad

PufferAI / PufferLib