Pinned Loading
-
mlx-grpo-trainer
mlx-grpo-trainer Public🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without…
Python 4
-
mlx-guided-grpo
mlx-guided-grpo PublicTrain reasoning models on your Mac. GRPO training framework for Apple Silicon with curriculum learning.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.