Stars
High-performance distributed data shuffling (all-to-all) library for MoE training and inference
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)