RLHF-V

RLHF-V

21 followers · 0 following

Achievements

Stars

OpenBMB / RLPR

Extrapolating RLVR to General Domains without Verifiers

Python 203 12 Updated Aug 12, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,828 83 Updated May 11, 2025

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 412 39 Updated Dec 15, 2024

OpenBMB / Eurus

Python 323 15 Updated Sep 18, 2024

LeapLabTHU / MLLA

[NeurIPS 2024] Official repository of MLLA

Python 375 17 Updated Jul 11, 2025

thunlp / Muffin

Python 65 3 Updated Feb 5, 2024

RLHF-V / RLAIF-V

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python 455 20 Updated May 14, 2025

OpenBMB / MiniCPM-V

A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone

Python 25,641 2,006 Updated Jun 4, 2026

flyskywhy / react-native-font-sim

React Native font SimSun <宋体> SimHei <黑体> KaiTi<楷体> , support iOS and Android both.

7 2 Updated Mar 15, 2023

dtde / simhei

1 Updated Apr 15, 2022

thuservices / thuservices

https://thu.services

JavaScript 438 58 Updated May 29, 2026

RLHF-V / RLHF-V

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 309 9 Updated Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RLHF-V

Achievements

Achievements

Block or report RLHF-V

Stars

OpenBMB / RLPR

BytedTsinghua-SIA / DAPO

RL4VLM / RL4VLM

OpenBMB / Eurus

LeapLabTHU / MLLA

thunlp / Muffin

RLHF-V / RLAIF-V

OpenBMB / MiniCPM-V

flyskywhy / react-native-font-sim

dtde / simhei

thuservices / thuservices

RLHF-V / RLHF-V