0russwest0

russwest404 0russwest0

52 followers · 4 following

Achievements

x2 x3

Achievements

x2 x3

Highlights

Stars

40 stars written in Python

Clear filter

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,605 32,700 Updated Mar 31, 2026

langflow-ai / langflow

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python 146,437 8,680 Updated Mar 31, 2026

langchain-ai / langchain

The agent engineering platform

Python 131,813 21,731 Updated Mar 31, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,292 5,091 Updated Mar 31, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,356 3,544 Updated Mar 31, 2026

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 16,201 1,397 Updated Feb 28, 2026

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 15,853 1,523 Updated Mar 4, 2026

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.5, DeepSeek-R1, GLM-5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...)…

Python 13,452 1,311 Updated Mar 31, 2026

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,114 783 Updated Mar 31, 2026

open-compass / opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,817 753 Updated Mar 30, 2026

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,330 530 Updated Mar 31, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 5,053 677 Updated Mar 29, 2026

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,790 363 Updated Mar 26, 2026

RUCAIBox / RecBole

A unified, comprehensive and efficient recommendation library

Python 4,356 732 Updated Feb 24, 2025

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,757 193 Updated Mar 31, 2026

CheshireCC / faster-whisper-GUI

faster_whisper GUI with PySide6

Python 2,919 168 Updated Dec 8, 2024

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,576 213 Updated Mar 28, 2026

ading2210 / poe-api

[UNMAINTAINED] A reverse engineered Python API wrapper for Quora's Poe, which provides free access to ChatGPT, GPT-4, and Claude.

Python 2,491 307 Updated Sep 18, 2023

bytedance / pasa

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,548 109 Updated May 27, 2025

AgentR1 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 1,324 86 Updated Mar 30, 2026

google-deepmind / android_env

RL research on Android devices.

Python 1,198 105 Updated Feb 26, 2026

bytedance / SandboxFusion

Python 967 93 Updated Dec 11, 2025

NVIDIA-NeMo / Skills

A project to improve skills of large language models

Python 905 169 Updated Mar 31, 2026

Melmaphother / Science-Star

Science-Star: A Platform for Building, Extending, and Experimenting with Scientific Agents.

Python 748 53 Updated Mar 4, 2026

google-research / android_world

AndroidWorld is an environment and benchmark for autonomous agents

Python 696 144 Updated Mar 25, 2026

agentscope-ai / Trinity-RFT

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 581 59 Updated Mar 31, 2026

axon-rl / gem

A Gym for Agentic LLMs

Python 472 31 Updated Jan 21, 2026

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 437 52 Updated Mar 25, 2026

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 414 22 Updated Aug 26, 2025

DigiRL-agent / digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.

Python 394 35 Updated Feb 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

russwest404 0russwest0

Achievements

Achievements

Highlights

Block or report 0russwest0

Stars

huggingface / transformers

langflow-ai / langflow

langchain-ai / langchain

sgl-project / sglang

verl-project / verl

microsoft / agent-lightning

QwenLM / Qwen-Agent

modelscope / ms-swift

OpenPipe / ART

open-compass / opencompass

rllm-org / rllm

THUDM / slime

hiyouga / EasyR1

RUCAIBox / RecBole

SwanHubX / SwanLab

CheshireCC / faster-whisper-GUI

mll-lab-nu / RAGEN

ading2210 / poe-api

bytedance / pasa

AgentR1 / Agent-R1

google-deepmind / android_env

bytedance / SandboxFusion

NVIDIA-NeMo / Skills

Melmaphother / Science-Star

google-research / android_world

agentscope-ai / Trinity-RFT

axon-rl / gem

mll-lab-nu / VAGEN

EvolvingLMMs-Lab / multimodal-search-r1

DigiRL-agent / digirl