mkurman

Mariusz Kurman mkurman

Achievements

synthetic-questions-generation synthetic-questions-generation Public

Python 82 10
grpo-llm-evaluator grpo-llm-evaluator Public

Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluations.

Python 51 5
neuroblast-v3 neuroblast-v3 Public

NeuroBLAST v3 architecture code

Python 36 2
synthlabs synthlabs Public

Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning traces in the SYNTH format.

TypeScript 29 8
ReasonFlow ReasonFlow Public

ReasonFlow is a novel framework designed to implement o1-like reasoning capabilities in large language models.

Python 19 5
mcts-pytorch mcts-pytorch Public

A flexible Monte Carlo Tree Search framework with PyTorch for decision-making in language models.

Jupyter Notebook 11 2