Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. SimpleVLA-RL SimpleVLA-RL Public

    SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 804 32

  2. Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 342 9

  3. RL-Compositionality RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    Python 22 3

  4. TTRL TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 833 63

  5. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.7k 100

  6. ImplicitPRM ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    Python 164 11

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics