-
Nanjing University
- Nanjing, China
- wengrx.github.io
Lists (9)
Sort Name ascending (A-Z)
Starred repositories
The agent that grows with you
Lightweight, open-source AI agent for your tools, chats, and workflows.
[TPAMI 2026] Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Book_5_《统计至简》 | 鸢尾花书:从加减乘除到机器学习;上架!
Pioneering Automated GUI Interaction with Native Agents
A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning materials.
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
Implementing DeepSeek R1's GRPO algorithm from scratch
Agent2Agent (A2A) is an open protocol enabling communication and interoperability between opaque agentic applications.
🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.
An Open-source RL System from ByteDance Seed and Tsinghua AIR
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Wan: Open and Advanced Large-Scale Video Generative Models
An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…
Official Repo for Open-Reasoner-Zero
Minimal reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Scalable RL solution for advanced reasoning of language models
A series of technical report on Slow Thinking with LLM