A junior in the Department of Computer Science and Technology at Tsinghua University.
-
Tsinghua University
- Beijing
-
03:06
(UTC -04:00) - https://racktic.github.io/
- https://scholar.google.com/citations?user=AvbV0HcAAAAJ&hl=en&oi=ao
Highlights
- Pro
Pinned Loading
-
RLHF-V/RLAIF-V
RLHF-V/RLAIF-V Public[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness
-
hiyouga/EasyR1
hiyouga/EasyR1 PublicEasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
-
verl-project/verl
verl-project/verl Publicverl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.