WeixiangYAN

Weixiang Yan WeixiangYAN

41 followers · 22 following

Palo Alto, CA
weixiangyan.github.io

Stars

newmarcel / KeepingYouAwake

Prevents your Mac from going to sleep.

Objective-C 6,115 243 Updated Oct 30, 2025

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,353 60 Updated Sep 5, 2025

browseros-ai / BrowserOS

🌐 The open-source Agentic browser; privacy-first alternative to ChatGPT Atlas, Perplexity Comet, Dia.

C++ 8,325 805 Updated Dec 24, 2025

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,398 206 Updated Dec 24, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,314 117 Updated Dec 11, 2025

PrimeIntellect-ai / verifiers

Our library for RL environments + evals

Python 3,661 455 Updated Dec 24, 2025

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 859 138 Updated Dec 24, 2025

luogu-dev / cyaron

CYaRon: Yet Another Random Olympic-iNformatics test data generator

Python 1,599 177 Updated Oct 26, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,904 469 Updated Dec 24, 2025

anthropics / claude-cookbooks

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 29,935 3,041 Updated Dec 23, 2025

anthropics / skills

Public repository for Agent Skills

Python 26,517 2,445 Updated Dec 20, 2025

THUDM / AgentRL

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 159 9 Updated Dec 16, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,722 1,359 Updated Dec 24, 2025

anthropics / prompt-eng-interactive-tutorial

Anthropic's Interactive Prompt Engineering Tutorial

Jupyter Notebook 27,571 2,617 Updated Jul 11, 2024

anthropics / claude-code

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Shell 48,389 3,404 Updated Dec 20, 2025

wshobson / agents

Intelligent automation and multi-agent orchestration for Claude Code

Python 23,386 2,592 Updated Dec 23, 2025

NVIDIA-NeMo / Skills

A project to improve skills of large language models

Python 720 133 Updated Dec 24, 2025

axon-rl / gem

A Gym for Agentic LLMs

Python 409 27 Updated Dec 23, 2025

TIGER-AI-Lab / verl-tool

A version of verl to support diverse tool use

Python 780 64 Updated Dec 24, 2025

open-thought / reasoning-gym

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,284 106 Updated Dec 15, 2025

lobehub / lobe-chat

🤯 LobeHub - an open-source, modern design AI Agent Workspace. Supports multiple AI providers, Knowledge Base (file upload / RAG ), one click install MCP Marketplace and Artifacts / Thinking. One-cl…

TypeScript 69,410 14,287 Updated Dec 24, 2025

agent-infra / sandbox

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 1,790 152 Updated Dec 16, 2025

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

Python 23,432 5,098 Updated Dec 24, 2025

huggingface / smol2operator

Python 126 18 Updated Sep 23, 2025

Cranot / claude-code-guide

Claude Code Comprehensive Guide

2,219 245 Updated Nov 6, 2025

netblue30 / firejail

Linux namespaces and seccomp-bpf sandbox

C 6,869 636 Updated Dec 23, 2025

R2E-Gym / R2E-Gym

[COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents

Python 215 41 Updated Jul 13, 2025

shareAI-lab / Kode-cli

not another coding agent, kode is agent cli for everything

TypeScript 3,833 594 Updated Dec 15, 2025

Yvictor / TradingGym

Trading and Backtesting environment for training reinforcement learning agent or simple rule base algo.

Python 1,777 357 Updated Feb 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly