- Palo Alto, CA
- weixiangyan.github.io
Stars
Prevents your Mac from going to sleep.
🌐 The open-source Agentic browser; privacy-first alternative to ChatGPT Atlas, Perplexity Comet, Dia.
SkyRL: A Modular Full-stack RL Library for LLMs
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
Our library for RL environments + evals
An interface library for RL post training with environments.
CYaRon: Yet Another Random Olympic-iNformatics test data generator
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Anthropic's Interactive Prompt Engineering Tutorial
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Intelligent automation and multi-agent orchestration for Claude Code
A project to improve skills of large language models
A version of verl to support diverse tool use
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.
[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents
not another coding agent, kode is agent cli for everything
Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.