-
MedIT Solutions Kurman i Wspólnicy Sp. z o. o.
- in/mariuszkurman
- @mkurman88
- https://huggingface.co/mkurman
Popular repositories Loading
-
-
grpo-llm-evaluator
grpo-llm-evaluator PublicFine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluations.
-
-
ReasonFlow
ReasonFlow PublicReasonFlow is a novel framework designed to implement o1-like reasoning capabilities in large language models.
-
mcts-pytorch
mcts-pytorch PublicA flexible Monte Carlo Tree Search framework with PyTorch for decision-making in language models.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.