-
Tsinghua University
- Tsinghua University
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
OpenClaw-RL: Train any agent simply by talking
This repository hosts a collection of datasets for training and evaluating CUA / GUI agents.
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
**DeepL免秘钥,免启服务**,双击使用,免费无限次使用,(**新增DeepL单词查询功能**)根据网页版JavaScript加密算法逆向开发的bobplugin;所以只要官网的算法不改,理论上就可以无限使用;(重大更新!!!回馈老用户,现已优化,频繁访问后仍然可以继续免费翻译!!) **apiKey is not required,No account password required**
A Survey of Reinforcement Learning for Large Reasoning Models
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Latest Advances on System-2 Reasoning
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
Official Repo for Open-Reasoner-Zero
This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Machine-generated text detection in the wild (ACL 2024)
Solutions of Reinforcement Learning, An Introduction
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…
Scalable RL solution for advanced reasoning of language models
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Towards Large Multimodal Models as Visual Foundation Agents