1k stars ๐
Python, LLMs, synthetic data, agents
-
Freelance
- Eindhoven / Trondheim
-
07:53
(UTC +01:00) - https://hf.co/ssmits
- @StijnSmits
- in/sjssmits
Stars
1
star
written in TeX
Clear filter
A Survey of Reinforcement Learning for Large Reasoning Models