Second Year PhD Candidate of Tsinghua University @thunlp
-
Tsinghua University
- Beijing, China
-
19:28
(UTC +08:00) - https://hbx-hbx.github.io/
- @hbx_hbx
Highlights
- Pro
Stars
3
stars
written in TeX
Clear filter
A Survey of Reinforcement Learning for Large Reasoning Models
A bibliography and survey of the papers surrounding o1