Skip to content
View jyakaranda's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report jyakaranda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 339 14 Updated Feb 10, 2026
Python 1,661 277 Updated Apr 3, 2026

This repository contains a curated collection of 300+ case studies from over 80 companies, detailing practical applications and insights into machine learning (ML) system design. The contents are o…

10,029 1,522 Updated Aug 5, 2025

i2LQR: Iterative LQR for Iterative Tasks in Dynamic Environments (CDC 2023) https://arxiv.org/abs/2302.14246

Python 21 1 Updated Mar 18, 2024

This repository contains multiple approaches for generating global racetrajectories.

Python 571 230 Updated Jul 6, 2023

[ICLR 2026] Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling

Python 110 29 Updated Jan 28, 2026

A lightweight deep learning training framework implemented from scratch in C++, featuring a PyTorch-style API.

C++ 168 26 Updated Apr 4, 2026

Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Pape…

Jupyter Notebook 792 187 Updated Jan 22, 2019

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,747 5,871 Updated Aug 14, 2024

A Gym for Agentic LLMs

Python 476 31 Updated Jan 21, 2026

The best ChatGPT that $100 can buy.

Python 51,515 6,820 Updated Mar 27, 2026

Deep Reinforcement Learning

4,575 677 Updated Dec 10, 2022

Wife approved HomeOps driven by Kubernetes and GitOps using Flux

YAML 2,775 219 Updated Apr 9, 2026

My GitOps-managed home Kubernetes cluster... and more! ⛵

Just 111 Updated Apr 10, 2026

GRPO training code which scales to 32xH100s for long horizon terminal/coding tasks. Base agent is now the top Qwen3 agent on Stanford's TerminalBench leaderboard.

Python 366 23 Updated Aug 24, 2025

A benchmark for LLMs on complicated tasks in the terminal

Python 1,940 499 Updated Jan 22, 2026

Lightweight coding agent that runs in your terminal

Rust 74,241 10,477 Updated Apr 10, 2026

A C library for creating Excel XLSX files.

C 1,733 383 Updated Jan 6, 2026

GPU documentation for humans

Python 558 69 Updated Mar 24, 2026

💫 Toolkit to help you get started with Spec-Driven Development

Python 86,675 7,445 Updated Apr 9, 2026

DELT: Data Efficacy for Language Model Training

Python 45 5 Updated Feb 12, 2026

Nano vLLM

Python 12,776 1,900 Updated Nov 3, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. AntRay is forked from ray, offering incremental new features on top …

Python 168 27 Updated Mar 21, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,224 707 Updated Apr 9, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,623 1,338 Updated Apr 9, 2026

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 11,303 1,227 Updated Feb 5, 2026

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 973 282 Updated Apr 9, 2026

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 19,969 1,447 Updated Mar 12, 2026

Open source software for autonomous drones.

C++ 3,110 456 Updated Apr 3, 2026

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 15,247 1,423 Updated Mar 26, 2026
Next