nkalyanv99

nkalyanv99

2 followers · 0 following

Achievements

Stars

24 results for source starred repositories

Clear filter

amirgholami / PyHessian

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 773 124 Updated Jul 10, 2025

ESHyperscale / HyperscaleES

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 218 18 Updated Dec 25, 2025

p-doom / crowd-code

Crowdsourcing months-long human software engineering trajectories.

TypeScript 3 Updated Jan 16, 2026

RLE-Foundation / Plasticine

Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.

Python 35 3 Updated Jan 23, 2026

amazon-far / holosoma-extensions

Python 38 5 Updated Dec 1, 2025

nkalyanv99 / UNI-D2

Python 53 6 Updated Jan 23, 2026

pengzhangzhi / Open-dLLM

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 511 38 Updated Nov 11, 2025

taodav / pobax

Partially Observable Benchmarks in JAX

Python 21 5 Updated Feb 4, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,979 3,182 Updated Feb 4, 2026

TheodoreWolf / hyperoptax

Parallel hyperparameter tuning with JAX

Python 39 Updated Jul 21, 2025

PiotrNawrot / sparse-frontier

The evaluation framework for training-free sparse attention in LLMs

Python 116 10 Updated Jan 27, 2026

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,702 159 Updated Dec 5, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,507 205 Updated Jan 25, 2026

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 3,798 490 Updated Feb 4, 2026

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 387 48 Updated Oct 29, 2025

DramaCow / jaxued

Python 91 14 Updated Jan 21, 2026

meta-pytorch / LeanRL

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 669 28 Updated Aug 22, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 51,591 4,264 Updated Feb 4, 2026

luchris429 / purejaxrl

Really Fast End-to-End Jax RL Implementations

Python 1,017 83 Updated Sep 9, 2024

epignatelli / navix

Accelerated minigrid environments with JAX

Python 156 21 Updated Oct 20, 2025

CleanDiffuserTeam / CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 695 68 Updated Apr 20, 2025

PatrickHua / SimSiam

A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'

Python 835 135 Updated Jul 14, 2022

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,298 3,871 Updated Feb 4, 2026

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,968 383 Updated Feb 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nkalyanv99

Achievements

Achievements

Block or report nkalyanv99

Stars

amirgholami / PyHessian

ESHyperscale / HyperscaleES

p-doom / crowd-code

RLE-Foundation / Plasticine

amazon-far / holosoma-extensions

nkalyanv99 / UNI-D2

pengzhangzhi / Open-dLLM

taodav / pobax

verl-project / verl

TheodoreWolf / hyperoptax

PiotrNawrot / sparse-frontier

Simple-Efficient / RL-Factory

mll-lab-nu / RAGEN

PrimeIntellect-ai / verifiers

EdanToledo / Stoix

DramaCow / jaxued

meta-pytorch / LeanRL

unslothai / unsloth

luchris429 / purejaxrl

epignatelli / navix

CleanDiffuserTeam / CleanDiffuser

PatrickHua / SimSiam

tinygrad / tinygrad

PufferAI / PufferLib