Skip to content
View 0russwest0's full-sized avatar

Highlights

  • Pro

Block or report 0russwest0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
40 stars written in Python
Clear filter

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,605 32,700 Updated Mar 31, 2026

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 146,437 8,680 Updated Mar 31, 2026

The agent engineering platform

Python 131,813 21,731 Updated Mar 31, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,292 5,091 Updated Mar 31, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,356 3,544 Updated Mar 31, 2026

The absolute trainer to light up AI agents.

Python 16,201 1,397 Updated Feb 28, 2026

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,853 1,523 Updated Mar 4, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,452 1,311 Updated Mar 31, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,114 783 Updated Mar 31, 2026

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,817 753 Updated Mar 30, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,330 530 Updated Mar 31, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,053 677 Updated Mar 29, 2026

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,790 363 Updated Mar 26, 2026

A unified, comprehensive and efficient recommendation library

Python 4,356 732 Updated Feb 24, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,757 193 Updated Mar 31, 2026

faster_whisper GUI with PySide6

Python 2,919 168 Updated Dec 8, 2024

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,576 213 Updated Mar 28, 2026

[UNMAINTAINED] A reverse engineered Python API wrapper for Quora's Poe, which provides free access to ChatGPT, GPT-4, and Claude.

Python 2,491 307 Updated Sep 18, 2023

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,548 109 Updated May 27, 2025

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,324 86 Updated Mar 30, 2026

RL research on Android devices.

Python 1,198 105 Updated Feb 26, 2026
Python 967 93 Updated Dec 11, 2025

A project to improve skills of large language models

Python 905 169 Updated Mar 31, 2026

Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.

Python 748 53 Updated Mar 4, 2026

AndroidWorld is an environment and benchmark for autonomous agents

Python 696 144 Updated Mar 25, 2026

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 581 59 Updated Mar 31, 2026

A Gym for Agentic LLMs

Python 472 31 Updated Jan 21, 2026

Training VLM agents with multi-turn reinforcement learning

Python 437 52 Updated Mar 25, 2026

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 414 22 Updated Aug 26, 2025

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 394 35 Updated Feb 22, 2025
Next