bordeauxred

Follow

bordeauxred

Follow

5 followers · 3 following

Achievements

Achievements

Stars

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 38,981 4,933 Updated Dec 9, 2025

facebookresearch / meta-agents-research-environments

Meta Agents Research Environments is a comprehensive platform designed to evaluate AI agents in dynamic, realistic scenarios. Unlike static benchmarks, this platform introduces evolving environment…

Python 403 55 Updated Nov 17, 2025

ComposioHQ / lovable-for-ai-agents

TypeScript 41 16 Updated Aug 13, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,297 116 Updated Dec 11, 2025

agno-agi / agno

The unified stack for multi-agent systems.

Python 36,181 4,785 Updated Dec 21, 2025

iliane5 / meridian

Meridian cuts through news noise by scraping hundreds of sources, analyzing stories with AI, and delivering concise, personalized daily briefs.

TypeScript 2,364 444 Updated May 30, 2025

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 3,653 453 Updated Dec 21, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,503 1,530 Updated Apr 24, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,661 2,860 Updated Dec 21, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,679 309 Updated Nov 13, 2025

shreyaskarnik / huggingface-mcp-server

Python 67 12 Updated Mar 19, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,446 194 Updated Dec 3, 2025

Skyvern-AI / skyvern

Automate browser based workflows with AI

Python 19,850 1,734 Updated Dec 21, 2025

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 9,004 1,200 Updated Dec 1, 2025

aa14k / Exploration-in-RL

Jupyter Notebook 29 13 Updated May 27, 2024

rlworkgroup / garage

A toolkit for reproducible reinforcement learning research.

Python 2,057 321 Updated May 4, 2023

Akella17 / Deep-Bayesian-Quadrature-Policy-Optimization

Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.

Python 17 7 Updated Feb 17, 2021