-
MedIT Solutions Kurman i Wspólnicy Sp. z o. o.
- in/mariuszkurman
- @mkurman88
- https://huggingface.co/mkurman
Popular repositories Loading
-
-
grpo-llm-evaluator
grpo-llm-evaluator PublicFine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluations.
-
ReasonFlow
ReasonFlow PublicReasonFlow is a novel framework designed to implement o1-like reasoning capabilities in large language models.
-
mcts-pytorch
mcts-pytorch PublicA flexible Monte Carlo Tree Search framework with PyTorch for decision-making in language models.
-
blurred-thoughts-SFT
blurred-thoughts-SFT PublicBlurred-Thoughts Supervised-Finetuning (BT-SFT) is a new approach to fine-tuning language models, focusing on enhancing response diversity and creativity.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.