yangzhch6

Follow

Deep Thinking

yangzhch6 yangzhch6

Deep Thinking

Follow

PhD Student | Reasoning with LLMs

34 followers · 63 following

Shenzhen, China
23:37 (UTC +08:00)
https://yangzhch6.github.io/

Achievements

Achievements

Highlights

Pro

Pinned Loading

DARS DARS Public

The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"

Python 19
Mirror-Critique Mirror-Critique Public

The official implemention of "Critique to Verify: Accurate and Honest Test-Time Scaling with RL-Trained Verifiers"

Python 7 1
ReSocratic ReSocratic Public

OptiBench and ReSocratic Synthesis Method

Python 29 1
AlignedCoT AlignedCoT Public

Implementation of our paper "Speak Like a Native: Prompting Large Language Models in a Native Style"

8 1