-
Stanford University
- Palo Alto, CA
- jiachengmiao.com
- @Jiacheng_Miao
Highlights
- Pro
Stars
LLM-powered personalized daily lists of scientific papers
A set of examples based on verl for end-to-end RL training recipes.
Example apps for the Apps SDK
High accuracy RAG for answering questions from scientific documents with citations
Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection
A set of beautifully-designed, accessible components and a code distribution platform. Works with your favorite frameworks. Open Source. Open Code.
Open-source platform to build and deploy AI agent workflows.
A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …
The contents of /home/oai/skills in ChatGPT's code interpreter environment
[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Accompanying code for "Discovering State-of-the-art Reinforcement Algorithms" Nature publication
slime is an LLM post-training framework for RL Scaling.
Mirix is a multi-agent personal assistant designed to track on-screen activities and answer user questions intelligently. By capturing real-time visual data and consolidating it into structured mem…
A benchmark for LLMs on complicated tasks in the terminal
Run Claude Agent (Claude Code) in a sandbox, control it via websocket
Tongyi Deep Research, the Leading Open-source Deep Research Agent
The most open diffusion language model for code generation — releasing pretraining, evaluation, inference, and checkpoints.
Public repo for rbio, a biologically-informed reasoning model trained on virtual cell models as verifiers
[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild
Use Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.