RLHFlow
RLHFlow
Code for the Workflow of Reinforcement Learning from Human Feedback (RLHF)
United States of America
數心
Matheart
CS PhD at UPenn. Previously HKUST CS+MATH. Deep Learning Theory. Content Creator @bilibili.
University of Pennsylvania Philadelphia, PA
Youcheng Li
xjtulyc
Machine Learning, Medical Imaging, Spatial Transcriptome
Peking University Peking
Fanpeng MENG
mfp0610
M.Phil. Student at CUHKSZ @GAP-LAB-CUHK-SZ | B.Eng. at HuazhongUST
@CUHKSZ Shenzhen, China
Junhua Liu
junhua-l
PhD@USC. VR/AR; AI Codec; HCI; ML for networking. Prev: CMU, Harvard, Sensetime
Shenzhen
Hanshi Sun
preminstrel
Research Scientist @ByteDance-Seed; MS@CMU; BS @seu; MLSys
CMU -> ByteDance Seed Bellevue
Yang (Thomas) Li
thomas-young-2013
Senior researcher, focusing on GenAI algorithms, frameworks/systems, and applications.
Tencent Inc. Peking University