verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 2,020 197 Updated Jun 9, 2026

hiyouga / LlamaFactory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 72,257 8,843 Updated Jun 17, 2026

wenhwu / awesome-remote-sensing-change-detection

A comprehensive and up-to-date compilation of datasets, tools, methods, review papers, and competitions for remote sensing change detection.

2,250 403 Updated Apr 16, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,409 1,791 Updated Jan 30, 2026

zli12321 / Vision-SR1

Reinforcement Learning of Vision Language Models with Self Visual Perception Reward

Python 174 17 Updated Mar 14, 2026

multimodal-art-projection / REER_DeepWriter

Forked from HaozheH3/REER_DeepWriter

REverse-Engineered Reasoning for Open-Ended Generation

Python 97 7 Updated Sep 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tang Xi ncTimTang

Achievements

Achievements

Highlights

Block or report ncTimTang

Stars

SkalskiP / top-cvpr-2026-papers

ncTimTang / tangxi.github.io

sotayang / Awesome-Streaming-Video-Understanding

EvolvingLMMs-Lab / SimpleStream

mit-han-lab / streaming-vlm

MME-Benchmarks / Video-MME-v2

qiujihao19 / LongVideo-R1

MzeroMiko / MzeroMiko.github.io

MzeroMiko / vHeat

MzeroMiko / XDLM

MzeroMiko / LLaDA-XDLM

feufhd / VideoAnchor

mingrui-wu / OSI-Bench

InternLM / CapRL

langfengQ / verl-agent

hiyouga / LlamaFactory

wenhwu / awesome-remote-sensing-change-detection

QwenLM / Qwen3-VL

zli12321 / Vision-SR1

multimodal-art-projection / REER_DeepWriter

ncTimTang / previous_tangxi.github.io

Dao-AILab / flash-attention

hiyouga / EasyR1

Open-Reasoner-Zero / Open-Reasoner-Zero

om-ai-lab / VLM-R1

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

verl-project / verl

martian422 / ReDDiT

callsys / GMPO

AZZMM / CC-Diff