Skip to content
View nkalyanv99's full-sized avatar

Block or report nkalyanv99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
24 results for source starred repositories
Clear filter

PyHessian is a Pytorch library for second-order based analysis and training of Neural Networks

Jupyter Notebook 773 124 Updated Jul 10, 2025

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 218 18 Updated Dec 25, 2025

Crowdsourcing months-long human software engineering trajectories.

TypeScript 3 Updated Jan 16, 2026

Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.

Python 35 3 Updated Jan 23, 2026
Python 53 6 Updated Jan 23, 2026

The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.

Python 511 38 Updated Nov 11, 2025

Partially Observable Benchmarks in JAX

Python 21 5 Updated Feb 4, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,979 3,182 Updated Feb 4, 2026

Parallel hyperparameter tuning with JAX

Python 39 Updated Jul 21, 2025

The evaluation framework for training-free sparse attention in LLMs

Python 116 10 Updated Jan 27, 2026

Train your Agent model via our easy and efficient framework

Python 1,702 159 Updated Dec 5, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,507 205 Updated Jan 25, 2026

Our library for RL environments + evals

Python 3,798 490 Updated Feb 4, 2026

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 387 48 Updated Oct 29, 2025
Python 91 14 Updated Jan 21, 2026

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 669 28 Updated Aug 22, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 51,591 4,264 Updated Feb 4, 2026

Really Fast End-to-End Jax RL Implementations

Python 1,017 83 Updated Sep 9, 2024

Accelerated minigrid environments with JAX

Python 156 21 Updated Oct 20, 2025

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Jupyter Notebook 695 68 Updated Apr 20, 2025

A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'

Python 835 135 Updated Jul 14, 2022

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,298 3,871 Updated Feb 4, 2026

Simplifying reinforcement learning for complex game environments

C 4,968 383 Updated Feb 4, 2026