This training offers an intensive exploration into the frontier of reinforcement learning techniques with large language models (LLMs). We will explore advanced topics such as Reinforcement Learnin…

Jupyter Notebook 59 34 Updated Mar 9, 2026

datanizing / oreilly-finetuning-llm

Jupyter Notebook 33 31 Updated Mar 15, 2026

sinanuozdemir / oreilly-evaluating-llms

Metrics, Benchmarks, and Practical Tools for Assessing Large Language Models

26 18 Updated Feb 16, 2025

sinanuozdemir / oreilly-ai-agents

An introduction to the world of AI Agents

Jupyter Notebook 265 194 Updated Mar 6, 2026

PacktPublishing / Deep-Reinforcement-Learning-Hands-On-Third-Edition

Deep Reinforcement Learning Hands-On, 3E_Published by Packt

Jupyter Notebook 433 168 Updated Mar 2, 2026

mimoralea / gdrl

Grokking Deep Reinforcement Learning

Jupyter Notebook 1,012 277 Updated Feb 4, 2022

zju3dv / GVHMR

Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024

Jupyter Notebook 1,496 177 Updated Jul 14, 2025

josancamon19 / rl-scaling-laws

qwen3-base family of models RL on gsm8k using verl, is there an RL power law on downstream tasks?

Python 27 1 Updated Oct 19, 2025

Shubhamsaboo / awesome-llm-apps

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 104,910 15,299 Updated Apr 1, 2026

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 3,747 516 Updated Apr 6, 2026

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 100,737 12,996 Updated Apr 9, 2026

humanlayer / 12-factor-agents

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 19,181 1,457 Updated Sep 21, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,324 2,267 Updated Apr 9, 2026

pixeltable / pixeltable

Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.

Python 1,621 206 Updated Apr 9, 2026

ryanburgess / engineer-manager

A list of engineering manager resource links.

JavaScript 10,696 644 Updated Mar 2, 2026

rhasspy / piper

A fast, local neural text to speech system

C++ 10,783 946 Updated Aug 26, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 86,785 10,018 Updated Apr 9, 2026

addamit

Lists (15)

DL

Electronic

Engg

Engineering

Gpu

LLM

llmapp

LLMD

LLMoh

pyapps

PythonDynamic

rag

RL

Robotically

Stable Meditations

Starred repositories

text-detection