**DeepL免秘钥,免启服务**,双击使用,免费无限次使用,(**新增DeepL单词查询功能**)根据网页版JavaScript加密算法逆向开发的bobplugin;所以只要官网的算法不改,理论上就可以无限使用;(重大更新!!!回馈老用户,现已优化,频繁访问后仍然可以继续免费翻译!!) **apiKey is not required,No account password required**

JavaScript 612 41 Updated Aug 30, 2024

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,420 129 Updated Nov 9, 2025

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 61,042 12,315 Updated Mar 11, 2026

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 87,071 10,038 Updated Apr 10, 2026

xuw / llm_course_public

Python 24 20 Updated Oct 12, 2025

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 1,342 77 Updated Jun 8, 2025

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

1,068 41 Updated Mar 15, 2026

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,089 119 Updated Jun 2, 2025

ray-project / ray-educational-materials

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 457 82 Updated Feb 13, 2024

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,446 164 Updated Mar 20, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,336 915 Updated Apr 11, 2026

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,846 289 Updated Dec 23, 2025

QZH-777 / longrag

3 Updated Jan 24, 2025

yafuly / MAGE

Machine-generated text detection in the wild (ACL 2024)

Python 224 14 Updated Mar 6, 2025

LyWangPX / Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,397 510 Updated Jul 10, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,631 1,339 Updated Apr 10, 2026