Skip to content
View 0russwest0's full-sized avatar

Block or report 0russwest0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The absolute trainer to light up AI agents.

Python 9,792 791 Updated Dec 22, 2025

Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.

Python 739 51 Updated Oct 9, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,059 643 Updated Dec 22, 2025

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 385 34 Updated Feb 22, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,051 75 Updated Nov 25, 2025

AndroidWorld is an environment and benchmark for autonomous agents

Python 548 112 Updated Nov 24, 2025

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,366 1,348 Updated Jul 9, 2025

A Universal Platform for Training and Evaluation of Mobile Interaction

Python 57 6 Updated Sep 24, 2025

RL research on Android devices.

Python 1,165 101 Updated Dec 16, 2025
Python 35 4 Updated Jul 7, 2025
Python 836 75 Updated Dec 11, 2025

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Python 199 18 Updated Apr 17, 2025

Code for InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation

Python 18 2 Updated May 29, 2025

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 449 45 Updated Dec 22, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,778 1,075 Updated Dec 22, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,892 3,832 Updated Dec 22, 2025

A project to improve skills of large language models

Python 710 132 Updated Dec 22, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,293 328 Updated Dec 15, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 368 18 Updated Aug 26, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,742 1,176 Updated Sep 26, 2025

Training VLM agents with multi-turn reinforcement learning

Python 349 42 Updated Dec 1, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,311 58 Updated Dec 7, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,699 2,866 Updated Dec 22, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,447 194 Updated Dec 3, 2025

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,460 112 Updated May 27, 2025

Comprehensive benchmark for RAG

Jupyter Notebook 249 30 Updated Jun 14, 2025

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 141,721 8,199 Updated Dec 22, 2025

EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.

136 11 Updated Dec 12, 2023
Python 59 6 Updated Jan 19, 2025
Next