Skip to content
View andyl98's full-sized avatar
🫥
🫥
  • Roblox
  • United States
  • 10:22 (UTC -07:00)
  • LinkedIn in/andyl98

Block or report andyl98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,923 279 Updated Jan 14, 2026

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 478 49 Updated Sep 30, 2024

A PyTorch native platform for training generative AI models

Python 5,175 756 Updated Mar 23, 2026

Post-training with Tinker

Python 2,972 358 Updated Mar 23, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,822 2,029 Updated Mar 23, 2026

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 602 114 Updated Mar 23, 2026

slime is an LLM post-training framework for RL Scaling.

Python 4,915 657 Updated Mar 23, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,274 522 Updated Mar 23, 2026

Toolchain manager for Roblox projects

Rust 239 34 Updated Jan 13, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 24,915 4,958 Updated Mar 23, 2026

The open source AI engineering platform for agents, LLMs, and ML models. MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while control…

Python 24,923 5,468 Updated Mar 23, 2026

Our library for RL environments + evals

Python 3,927 520 Updated Mar 23, 2026

Lightweight coding agent that runs in your terminal

Rust 67,084 8,970 Updated Mar 23, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,238 262 Updated Dec 15, 2025

Copilot Chat extension for VS Code

TypeScript 9,669 1,764 Updated Mar 23, 2026

Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.

Rust 77,724 7,519 Updated Mar 23, 2026

✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork

Python 313 18 Updated Sep 6, 2025

This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?"

1,439 139 Updated Jul 18, 2025

Model Context Protocol Servers

TypeScript 81,865 10,043 Updated Mar 17, 2026

Train transformer language models with reinforcement learning.

Python 6 2 Updated Jul 28, 2025

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 435 28 Updated Feb 28, 2026

s1: Simple test-time scaling

Python 6,650 766 Updated Jun 25, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,138 3,483 Updated Mar 23, 2026

Fully open reproduction of DeepSeek-R1

Python 25,959 2,416 Updated Nov 24, 2025

⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI

TypeScript 32,006 4,294 Updated Mar 23, 2026

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,226 903 Updated Mar 23, 2026

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,494 791 Updated Jan 14, 2026

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,064 2,909 Updated Nov 3, 2025

The Open Cookbook for Top-Tier Code Large Language Model

Python 2,062 122 Updated Dec 8, 2024
Next