👳♂️
I think it’s a new feature. Don’t tell anyone it was an accident.
-
Global Knowledge
- Jakarta, Indonesia
- indepeo.dev
Highlights
Popular repositories Loading
-
rwkv-reward-enhanced
rwkv-reward-enhanced PublicThis repository contains an enhanced reward model training procedure using RWKV for RLHF. It's a work in progress with a focus on generating diverse trajectories and high-quality answers.
Python 3
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.