0russwest0

russwest404 0russwest0

46 followers · 3 following

Achievements

x2 x3

Achievements

x2 x3

Stars

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 9,792 791 Updated Dec 22, 2025

Melmaphother / Science-Star

Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.

Python 739 51 Updated Oct 9, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 8,059 643 Updated Dec 22, 2025

DigiRL-agent / digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 385 34 Updated Feb 22, 2025

0russwest0 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,051 75 Updated Nov 25, 2025

google-research / android_world

AndroidWorld is an environment and benchmark for autonomous agents

Python 548 112 Updated Nov 24, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,366 1,348 Updated Jul 9, 2025

X-LANCE / Mobile-Env

A Universal Platform for Training and Evaluation of Mobile Interaction

Python 57 6 Updated Sep 24, 2025

google-deepmind / android_env

RL research on Android devices.

Python 1,165 101 Updated Dec 16, 2025

YuxiangChai / A3

Python 35 4 Updated Jul 7, 2025

bytedance / SandboxFusion

Python 836 75 Updated Dec 11, 2025

YifeiZhou02 / ArCHer

Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"

Python 199 18 Updated Apr 17, 2025

YunjiaXi / InfoDeepSeek

Code for InfoDeepSeek: Benchmarking Agentic Information Seeking for Retrieval-Augmented Generation

Python 18 2 Updated May 29, 2025

modelscope / Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 449 45 Updated Dec 22, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,778 1,075 Updated Dec 22, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,892 3,832 Updated Dec 22, 2025

NVIDIA-NeMo / Skills

A project to improve skills of large language models

Python 710 132 Updated Dec 22, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,293 328 Updated Dec 15, 2025

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 368 18 Updated Aug 26, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,742 1,176 Updated Sep 26, 2025

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 349 42 Updated Dec 1, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,311 58 Updated Dec 7, 2025

0russwest0 / Awesome-Agent-RL

452 18 Updated Oct 11, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,699 2,866 Updated Dec 22, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,447 194 Updated Dec 3, 2025

bytedance / pasa

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,460 112 Updated May 27, 2025

facebookresearch / CRAG

Comprehensive benchmark for RAG

Jupyter Notebook 249 30 Updated Jun 14, 2025

langflow-ai / langflow

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 141,721 8,199 Updated Dec 22, 2025

hyintell / awesome-refreshing-llms

EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.

136 11 Updated Dec 12, 2023

USTCAGI / CRAG-in-KDD-Cup2024

Python 59 6 Updated Jan 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

russwest404 0russwest0

Achievements

Achievements

Block or report 0russwest0

Stars

microsoft / agent-lightning

Melmaphother / Science-Star

OpenPipe / ART

DigiRL-agent / digirl

0russwest0 / Agent-R1

google-research / android_world

HW-whistleblower / True-Story-of-Pangu

X-LANCE / Mobile-Env

google-deepmind / android_env

YuxiangChai / A3

bytedance / SandboxFusion

YifeiZhou02 / ArCHer

YunjiaXi / InfoDeepSeek

modelscope / Trinity-RFT

modelscope / ms-swift

sgl-project / sglang

NVIDIA-NeMo / Skills

hiyouga / EasyR1

EvolvingLMMs-Lab / multimodal-search-r1

QwenLM / Qwen-Agent

mll-lab-nu / VAGEN

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

0russwest0 / Awesome-Agent-RL

volcengine / verl

mll-lab-nu / RAGEN

bytedance / pasa

facebookresearch / CRAG

langflow-ai / langflow

hyintell / awesome-refreshing-llms

USTCAGI / CRAG-in-KDD-Cup2024