verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,309 117 Updated Dec 11, 2025

WangRongsheng / awesome-LLM-resources

🧑‍🚀 全世界最好的LLM资料总结（多模态生成、Agent、辅助编程、AI审稿、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型） | Summary of the world's best LLM resources.

7,028 682 Updated Dec 18, 2025

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,668 156 Updated Dec 5, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,731 2,877 Updated Dec 23, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,647 839 Updated Dec 18, 2025

jingtian11 / EasyOffer

《EasyOffer》（<大模型面经合集>）是针对LLM宝宝们量身打造的大模型暑期实习Offer指南，主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等；小白一个，正在学习ing......有问题各位大佬随时指正，希望大家都能拿到心仪Offer！

Jupyter Notebook 602 46 Updated Mar 25, 2025

yuzhaouoe / SAE-based-representation-engineering

[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering

Python 68 6 Updated Nov 25, 2024

zixian2021 / AI-interview-cards

最完整的AI算法面试题目仓库，1000道，25个类目

1,313 114 Updated Aug 13, 2023

calubkk / RAAT

[ACL-2024]Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training

Python 40 3 Updated Oct 28, 2024

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 11,241 1,251 Updated Oct 10, 2025

PacktPublishing / Mastering-Transformers

Mastering Transformers, published by Packt

Jupyter Notebook 358 150 Updated Dec 15, 2025

zjunlp / KnowUnDo

[EMNLP 2024] To Forget or Not? Towards Practical Knowledge Unlearning for Large Language Models

Python 47 1 Updated Jan 23, 2025

II-Bench / II-Bench

Python 27 3 Updated Oct 28, 2024

voidism / DoLa

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 532 66 Updated Jan 17, 2025

shmsw25 / FActScore

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 413 61 Updated Apr 13, 2025

HillZhang1999 / llm-hallucination-survey

Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large Language Models"

1,068 54 Updated Sep 27, 2025

ydyjya / Awesome-LLM-Safety

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,723 87 Updated Dec 19, 2025

ictnlp / TruthX

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

Python 144 7 Updated Mar 26, 2024

HITsz-TMG / Ext-Sub

Official implementation of our paper "Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation". A model merge method for deficiency unlearning, compi…

Python 11 2 Updated Sep 20, 2024

HillZhang1999 / ICD

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

Python 69 10 Updated Feb 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dingwei Chen CuSO4-Chen

Block or report CuSO4-Chen

Starred repositories

RUC-NLPIR / ARPO

yangzhch6 / TreeRPO

S1s-Z / CANOE

S1s-Z / NOVA

S1s-Z / GATEAU

CuSO4-Chen / PLI

xhyumiracle / Awesome-AgenticLLM-RL-Papers

0russwest0 / Agent-R1

rllm-org / rllm

0russwest0 / Awesome-Agent-RL

langfengQ / verl-agent