Skip to content
View RLHF-V's full-sized avatar

Block or report RLHF-V

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Extrapolating RLVR to General Domains without Verifiers

Python 203 12 Updated Aug 12, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,828 83 Updated May 11, 2025

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 412 39 Updated Dec 15, 2024
Python 323 15 Updated Sep 18, 2024

[NeurIPS 2024] Official repository of MLLA

Python 375 17 Updated Jul 11, 2025
Python 65 3 Updated Feb 5, 2024

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python 455 20 Updated May 14, 2025

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python 25,641 2,006 Updated Jun 4, 2026

React Native font SimSun <宋体> SimHei <黑体> KaiTi<楷体> , support iOS and Android both.

7 2 Updated Mar 15, 2023
1 Updated Apr 15, 2022

https://thu.services

JavaScript 438 58 Updated May 29, 2026

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 309 9 Updated Sep 11, 2024