-
Shanghai Jiao Tong University
- Shanghai
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
verl: Volcano Engine Reinforcement Learning for LLMs
🤗 smolagents: a barebones library for agents that think in code.
Latent Collaboration in Multi-Agent Systems
A beautiful, simple, clean, and responsive Jekyll theme for academics
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"
AlphaGo Moment for Model Architecture Discovery.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
Evolve your language agent with Agentic Context Engineering (ACE)
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping" by Zhiheng Xi et al.
Official Repository of Absolute Zero Reasoner
Generative AI Act II: Test Time Scaling Drives Cognition Engineering
Compacts the size of the ever-growing WSL vhdx images.
Rust version of THU uCore OS. Linux compatible.