Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. P1 P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    74 4

  2. SimpleVLA-RL SimpleVLA-RL Public

    [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 1.4k 83

  3. Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 420 15

  4. RL-Compositionality RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    Python 60 5

  5. TTRL TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 989 72

  6. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.8k 103

Repositories

Showing 8 of 8 repositories
  • P1-VL Public

    P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

    PRIME-RL/P1-VL’s past year of commit activity
    13 1 0 0 Updated Feb 11, 2026
  • RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    PRIME-RL/RL-Compositionality’s past year of commit activity
    Python 60 Apache-2.0 5 2 0 Updated Jan 26, 2026
  • SimpleVLA-RL Public

    [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    PRIME-RL/SimpleVLA-RL’s past year of commit activity
    Python 1,379 MIT 83 45 1 Updated Jan 6, 2026
  • P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    PRIME-RL/P1’s past year of commit activity
    74 4 3 0 Updated Dec 29, 2025
  • TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    PRIME-RL/TTRL’s past year of commit activity
    Python 989 MIT 72 16 0 Updated Sep 26, 2025
  • Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    PRIME-RL/Entropy-Mechanism-of-RL’s past year of commit activity
    Python 420 15 2 0 Updated Jul 11, 2025
  • PRIME Public

    Scalable RL solution for advanced reasoning of language models

    PRIME-RL/PRIME’s past year of commit activity
    Python 1,805 Apache-2.0 103 8 1 Updated Mar 18, 2025
  • ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    PRIME-RL/ImplicitPRM’s past year of commit activity
    Python 168 Apache-2.0 11 12 0 Updated Mar 14, 2025

Top languages

Loading…

Most used topics