olliestanley

Oliver Stanley olliestanley

ML @ Scale AI

73 followers · 14 following

Highlights

Organizations

Lists (32)

Sort

Starred repositories

Sean-V-Dev / HMLR-Agentic-AI-Memory-System

Living memory for AI

Python 381 47 Updated Dec 31, 2025

radixark / miles

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,340 208 Updated May 16, 2026

algorithmicsuperintelligence / openevolve

Open-source implementation of AlphaEvolve

Python 6,293 1,011 Updated Mar 18, 2026

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,752 163 Updated Dec 5, 2025

gpustack / gpustack

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 5,018 528 Updated May 16, 2026

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 141,592 22,235 Updated May 16, 2026

google / langextract

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 36,473 2,511 Updated May 15, 2026

MinishLab / semhash

Fast Multimodal Semantic Deduplication & Filtering

Python 924 56 Updated May 4, 2026

PufferAI / PufferLib

Puffing up reinforcement learning

C 5,687 453 Updated May 14, 2026

PrimeIntellect-ai / prime-rl

Agentic RL Training at Scale

Python 1,375 289 Updated May 16, 2026

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 4,112 546 Updated May 16, 2026

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,857 322 Updated May 16, 2026

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 49,753 5,542 Updated May 15, 2026

huggingface / lerobot

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 24,054 4,543 Updated May 16, 2026

LeanModels / DFloat11

DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference

Python 630 37 Updated Nov 24, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,660 224 Updated Apr 14, 2026

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,935 1,140 Updated Mar 16, 2026

October2001 / Awesome-KV-Cache-Compression

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

706 25 Updated Apr 15, 2026

sentient-agi / OpenDeepSearch

SOTA search powered LLM

Python 3,819 340 Updated Apr 4, 2025

microsoft / KBLaM

Official Implementation of "KBLaM: Knowledge Base augmented Language Model"

Jupyter Notebook 1,445 121 Updated Apr 20, 2026

WukLab / preble

Stateful LLM Serving

Python 102 16 Updated Mar 11, 2025

MathFoundationRL / Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,191 1,524 Updated May 9, 2026

modal-labs / gpu-glossary

GPU documentation for humans

Python 601 77 Updated Mar 24, 2026

networkx / networkx

Network Analysis in Python

Python 16,913 3,508 Updated May 14, 2026

facebookresearch / swe-rl

[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 694 59 Updated Mar 16, 2025

KellerJordan / Muon

Muon is an optimizer for hidden layers in neural networks

Python 2,586 119 Updated Jan 19, 2026

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,652 1,034 Updated Apr 30, 2026

sgl-project / sgl-learning-materials

Materials for learning SGLang

820 63 Updated Jan 5, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 26,019 2,419 Updated Apr 2, 2026

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,422 119 Updated Apr 17, 2026

Terminal

Tensorflow

SQL

Python

Natural language processing

Machine learning

Java

Deployment

Deep learning

Database

See all starred topics

Oliver Stanley olliestanley

Highlights

Organizations

Lists (32)

AI Safety

Anomaly Detection

API Programming

Audio Manipulation

Autonomous Agents

Code Generation

Computer Vision

Data Manipulation

Datasets

Development Tools

Economics and Finance

Football Analytics

Game Development

Graph ML

Guides and Demos

Hardware

Image Generation

Java Libraries

Large Language Models

Memory

ML Deployment

Model Compression

Model Explainability

Modeling Libraries

Natural Language Processing

Other Generation

Publishing and Management

Reinforcement Learning

Retrieval

Robotics ML

Statistics and Causal Inference

Time Series