Skip to content
View nkalyanv99's full-sized avatar

Block or report nkalyanv99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 47 4 Updated Dec 21, 2025

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 488 34 Updated Nov 11, 2025

Partially Observable Benchmarks in JAX

Python 21 4 Updated Dec 1, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,665 2,861 Updated Dec 21, 2025

Parallel hyperparameter tuning with JAX

Python 38 Updated Jul 21, 2025

The evaluation framework for training-free sparse attention in LLMs

Python 106 8 Updated Oct 13, 2025

Train your Agent model via our easy and efficient framework

Python 1,666 156 Updated Dec 5, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,447 194 Updated Dec 3, 2025

Our library for RL environments + evals

Python 3,654 453 Updated Dec 21, 2025

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 376 44 Updated Oct 29, 2025
Python 89 12 Updated Sep 9, 2025

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 663 28 Updated Aug 22, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,705 4,100 Updated Dec 20, 2025

Really Fast End-to-End Jax RL Implementations

Python 1,005 82 Updated Sep 9, 2024

Accelerated minigrid environments with JAX

Python 153 21 Updated Oct 20, 2025

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 673 65 Updated Apr 20, 2025

A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'

Python 836 135 Updated Jul 14, 2022

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,908 3,787 Updated Dec 21, 2025

Simplifying reinforcement learning for complex game environments

C 4,642 345 Updated Dec 19, 2025