Skip to content
View sleexyz's full-sized avatar

Block or report sleexyz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
TypeScript 16,168 601 Updated Mar 30, 2026

Train transformer language models with reinforcement learning.

Python 17,843 2,594 Updated Mar 30, 2026

Open-source repo for applying continual learning to autoresearch with SDPO and other RL algorithms. Current example: Qwen3-14B achieves 1.023 val_bpb (−3.1%), surpassing the original Karpathy agent…

Python 3 Updated Mar 27, 2026

Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights"

Python 408 39 Updated Mar 20, 2026

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 4,474 2,805 Updated Mar 28, 2026

Code for "Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models".

Python 15 Updated Mar 16, 2026

🏝️ OASIS: Open Agent Social Interaction Simulations with One Million Agents.

Python 3,962 421 Updated Mar 20, 2026

Digital Red Queen: Adversarial Program Evolution in Core War with LLMs

Red 188 26 Updated Jan 13, 2026

Advocacy org for AI in service of New Yorkers

TypeScript 1 Updated Mar 6, 2026

A repository for major/influential FEP and active inference papers.

TeX 225 30 Updated Jun 28, 2021

A wrapper around the Craftax agent benchmark, for evaluating digital agents over extremely long time horizons

Python 7 Updated Feb 24, 2025

Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents

Swift 11,588 803 Updated Mar 30, 2026

[NeurIPS '25] Knowledge Graph Generation from Any Text

Python 1,085 160 Updated Mar 24, 2026

Turn any collection of documents into a knowledge graph. Extract entities and relationships via LLM, deduplicate with your approval. Map domains, find hidden connections, spot patterns across docum…

Python 451 37 Updated Mar 16, 2026

React components for visualizing traces from AI agents

TypeScript 316 16 Updated Feb 21, 2026

Agentic Research and Evaluation Suite

Python 86 14 Updated Mar 27, 2026

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

Python 178 32 Updated Mar 12, 2026
TypeScript 6 Updated Mar 7, 2026

Code Editor for the AI Agents Era - Run an army of Claude Code, Codex, etc. on your machine

TypeScript 8,263 617 Updated Mar 30, 2026

policy with experience

Python 63 2 Updated Feb 25, 2026

A recursive coding agent inpired by RLMs

Shell 259 26 Updated Mar 24, 2026

Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).

Python 120 14 Updated Jan 26, 2025
TypeScript 5 Updated Jan 16, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,323 3,535 Updated Mar 30, 2026
Python 313 45 Updated Dec 12, 2025

An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero human data.

Python 26 2 Updated Feb 25, 2026

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,721 285 Updated Mar 30, 2026

OpenClaw-RL: Train any agent simply by talking

Python 4,411 437 Updated Mar 28, 2026

RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings

Python 417 49 Updated Feb 27, 2026
Next