Floating Point Sigma Lab
Popular repositories Loading
-
trl-slime
trl-slime PublicForked from huggingface/trl
Train transformer language models with reinforcement learning.
Python 4
Repositories
Showing 1 of 1 repositories
- trl-slime Public Forked from huggingface/trl
Train transformer language models with reinforcement learning.
fpsigma/trl-slime’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…