Skip to content
View olliestanley's full-sized avatar

Highlights

  • Pro

Organizations

@scaleapi @open-thought

Block or report olliestanley

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

AI Safety

1 repository

Anomaly Detection

Related to machine learning for anomaly detection
7 repositories

API Programming

Related to API/backend programming
5 repositories

Audio Manipulation

Related to audio generation, compression, and more with machine learning
10 repositories

Autonomous Agents

Related to using AI to create autonomous agents
7 repositories

Code Generation

Related to machine learning for generating code
5 repositories

Computer Vision

Related to computer vision with machine learning
45 repositories

Data Manipulation

Related to exploring and processing data
21 repositories

Datasets

Repositories containing (links to) datasets
12 repositories

Development Tools

Related to easing development of machine learning projects
35 repositories

Economics and Finance

Related to computational methods for economics and finance
5 repositories

Football Analytics

Relating to analytics with football (soccer) data
12 repositories

Game Development

Related to game development, primarily using AI tools
3 repositories

Graph ML

Related to machine learning for graph data
5 repositories

Guides and Demos

Related to learning ML and Python
30 repositories

Hardware

GPUs & other hardware
1 repository

Image Generation

Related to generating images using neural networks
23 repositories

Java Libraries

Java libraries I have found useful in the past
18 repositories

Large Language Models

Related to large language model (LLM) implementations
57 repositories

Memory

1 repository

ML Deployment

Related to deploying machine learning models
22 repositories

Model Compression

Related to machine learning model compression
13 repositories

Model Explainability

Related to explaining machine learning models
14 repositories

Modeling Libraries

Relating to code for machine learning or statistical models
38 repositories

Natural Language Processing

Related to NLP-based machine learning (excludes LLM implementations)
27 repositories

Other Generation

Related to generating non-image/audio outputs using neural networks (video, DNA, ...)
9 repositories

Publishing and Management

Related to project management and publishing
3 repositories

Reinforcement Learning

Related to reinforcement learning
17 repositories

Retrieval

4 repositories

Robotics ML

Related to machine learning for robotics
2 repositories

Statistics and Causal Inference

Related to causal inference or statistical tools outside machine learning
10 repositories

Time Series

Related to machine learning for time series problems
25 repositories

Starred repositories

Showing results

Living memory for AI

Python 381 47 Updated Dec 31, 2025

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,340 208 Updated May 16, 2026

Open-source implementation of AlphaEvolve

Python 6,293 1,011 Updated Mar 18, 2026

Train your Agent model via our easy and efficient framework

Python 1,752 163 Updated Dec 5, 2025

A GPU cluster manager that configures and orchestrates inference engines like vLLM and SGLang for high-performance AI model deployment.

Python 5,018 528 Updated May 16, 2026

Production-ready platform for agentic workflow development.

TypeScript 141,592 22,235 Updated May 16, 2026

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 36,473 2,511 Updated May 15, 2026

Fast Multimodal Semantic Deduplication & Filtering

Python 924 56 Updated May 4, 2026

Puffing up reinforcement learning

C 5,687 453 Updated May 14, 2026

Agentic RL Training at Scale

Python 1,375 289 Updated May 16, 2026

Our library for RL environments + evals

Python 4,112 546 Updated May 16, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,857 322 Updated May 16, 2026

Port of OpenAI's Whisper model in C/C++

C++ 49,753 5,542 Updated May 15, 2026

🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning

Python 24,054 4,543 Updated May 16, 2026

DFloat11 [NeurIPS '25]: Lossless Compression of LLMs and DiTs for Efficient GPU Inference

Python 630 37 Updated Nov 24, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,660 224 Updated Apr 14, 2026

Machine Learning Engineering Open Book

Python 17,935 1,140 Updated Mar 16, 2026

📰 Must-read papers on KV Cache Compression (constantly updating 🤗).

706 25 Updated Apr 15, 2026

SOTA search powered LLM

Python 3,819 340 Updated Apr 4, 2025

Official Implementation of "KBLaM: Knowledge Base augmented Language Model"

Jupyter Notebook 1,445 121 Updated Apr 20, 2026

Stateful LLM Serving

Python 102 16 Updated Mar 11, 2025

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 16,191 1,524 Updated May 9, 2026

GPU documentation for humans

Python 601 77 Updated Mar 24, 2026

Network Analysis in Python

Python 16,913 3,508 Updated May 14, 2026

[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 694 59 Updated Mar 16, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,586 119 Updated Jan 19, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,652 1,034 Updated Apr 30, 2026

Materials for learning SGLang

820 63 Updated Jan 5, 2026

Fully open reproduction of DeepSeek-R1

Python 26,019 2,419 Updated Apr 2, 2026

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,422 119 Updated Apr 17, 2026
Next