Stars
Lightweight, open-source AI agent for your tools, chats, and workflows.
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual training steps.
Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
[NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.
Open-source implementation of AlphaEvolve
GenAI for Optimization and Decision Intelligence
Recent research papers about Foundation Models for Combinatorial Optimization
A Collection on Large Language Models for Optimization
LLM4AD: A Platform for Algorithm Design with Large Language Model
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
My learning notes for ML SYS.
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Scalable toolkit for efficient model reinforcement
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
Training Large Language Model to Reason in a Continuous Latent Space
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmark environments and conclude a series of research works for g…