Skip to content
View jyakaranda's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report jyakaranda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 339 14 Updated Feb 10, 2026
Python 1,637 265 Updated Mar 25, 2026

This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are o…

9,791 1,452 Updated Aug 5, 2025

i2LQR: Iterative LQR for Iterative Tasks in Dynamic Environments (CDC 2023) https://arxiv.org/abs/2302.14246

Python 21 1 Updated Mar 18, 2024

This repository contains multiple approaches for generating global racetrajectories.

Python 571 231 Updated Jul 6, 2023

[ICLR 2026] Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling

Python 105 27 Updated Jan 28, 2026

A tiny deep learning training framework implemented from scratch in C++ that follows PyTorch's API.

C++ 165 26 Updated Mar 26, 2026

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Pape…

Jupyter Notebook 791 187 Updated Jan 22, 2019

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,730 5,869 Updated Aug 14, 2024

A Gym for Agentic LLMs

Python 474 31 Updated Jan 21, 2026

The best ChatGPT that $100 can buy.

Python 50,893 6,693 Updated Mar 27, 2026

Deep Reinforcement Learning

4,573 677 Updated Dec 10, 2022

Wife approved HomeOps driven by Kubernetes and GitOps using Flux

YAML 2,769 219 Updated Apr 2, 2026

My GitOps-managed home Kubernetes cluster... and more! ⛵

Just 111 Updated Apr 2, 2026

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 364 23 Updated Aug 24, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,850 497 Updated Jan 22, 2026

Lightweight coding agent that runs in your terminal

Rust 72,589 10,156 Updated Apr 2, 2026

A C library for creating Excel XLSX files.

C 1,732 381 Updated Jan 6, 2026

GPU documentation for humans

Python 553 67 Updated Mar 24, 2026

💫 Toolkit to help you get started with Spec-Driven Development

Python 84,764 7,263 Updated Apr 2, 2026

DELT: Data Efficacy for Language Model Training

Python 45 6 Updated Feb 12, 2026

Nano vLLM

Python 12,645 1,851 Updated Nov 3, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. AntRay is forked from ray, offering incremental new features on top …

Python 168 27 Updated Mar 21, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,082 683 Updated Mar 29, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,496 1,318 Updated Apr 2, 2026

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 11,232 1,213 Updated Feb 5, 2026

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 972 281 Updated Apr 2, 2026

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 19,700 1,423 Updated Mar 12, 2026

Open source software for autonomous drones.

C++ 3,106 456 Updated Dec 18, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 15,158 1,421 Updated Mar 26, 2026
Next