Deep Thinking
PhD Student | Reasoning with LLMs
- Shenzhen, China
-
23:37
(UTC +08:00) - https://yangzhch6.github.io/
Highlights
- Pro
Pinned Loading
-
Mirror-Critique
Mirror-Critique PublicThe official implemention of "Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers"
-
-
AlignedCoT
AlignedCoT PublicImplementation of our paper "Speak Like a Native: Prompting Large Language Models in a Native Style"
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.