mkurman

Mariusz Kurman mkurman

34 followers · 4 following

MedIT Solutions Kurman i Wspólnicy Sp. z o. o.
in/mariuszkurman
@mkurman88
https://huggingface.co/mkurman

Achievements

synthetic-conversation-generator Public

A Python script to programmatically generate synthetic conversations for training LLMs, chatbots, and dialogue systems.

Python 3 1 Apache License 2.0 Updated Oct 22, 2025
jepa-llm Public

Fine-tuning causal language models with an additional JEPA-style representation regularisation loss as well as a plain Hugging Face trainer

Python 3 Apache License 2.0 Updated Oct 5, 2025
synthetic-questions-generation Public

Python 77 9 Apache License 2.0 Updated Aug 27, 2025
grpo-llm-evaluator Public

Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluations.

Python 47 4 Apache License 2.0 Updated May 7, 2025
augment-swebench-agent Public
Forked from augmentcode/augment-swebench-agent

The #1 open-source SWE-bench Verified implementation

Python Other Updated Apr 9, 2025
ii-researcher Public
Forked from Intelligent-Internet/ii-researcher

II-Researcher: a new open-source framework designed to aid building search / research agents

Python Apache License 2.0 Updated Mar 29, 2025
kron_torch Public
Forked from evanatyourservice/kron_torch

An implementation of PSGD Kron second-order optimizer for PyTorch

Python Creative Commons Attribution 4.0 International Updated Mar 24, 2025
nvamp-loss Public

Normalized Variance-Aware Max-Penalized Loss

Python 1 Apache License 2.0 Updated Mar 13, 2025
ReasonFlow Public

ReasonFlow is a novel framework designed to implement o1-like reasoning capabilities in large language models.

Python 17 5 MIT License Updated Feb 25, 2025
fixed-size-kv-cache Public

This project implements a fixed-size key-value cache for use in transformer models, specifically designed to work with the LLaMA model. The cache dynamically truncates the key and value states base…

Python 2 1 Apache License 2.0 Updated Feb 25, 2025
mcts-pytorch Public

A flexible Monte Carlo Tree Search framework with PyTorch for decision-making in language models.

Jupyter Notebook 10 1 MIT License Updated Feb 23, 2025
blurred-thoughts-SFT Public

Blurred-Thoughts Supervised-Finetuning (BT-SFT) is a new approach to fine-tuning language models, focusing on enhancing response diversity and creativity.

Python 6 1 MIT License Updated Feb 8, 2025
Large-Language-Model-Notebooks-Course Public
Forked from peremartra/Large-Language-Model-Notebooks-Course

Practical course about Large Language Models.

Jupyter Notebook 2 MIT License Updated Nov 11, 2024
self_reward_head_pytorch Public

This repository contains the implementation of a self-reward head designed for language models. The self-reward head enables the model to autonomously score its generated outputs, promoting self-as…

Python 2 Apache License 2.0 Updated Jun 21, 2024
linearmoe_pytorch Public

This repo contains my custom implementation of a mixture of experts as an extension of the linear layer.

Python 1 Apache License 2.0 Updated Jun 20, 2024
ai_agents_slide_deck Public archive

The repository houses the code and resources for running intelligent agents that automatically create comprehensive slide decks on specified topics..

Python Apache License 2.0 Updated Jun 3, 2024
hf_data_generation Public archive

This repo contains simple jupyter notebook based machine-learning instruction datasets generation using open-sourced huggingface models and hugginface pro subscription.

Python Updated May 30, 2024

Mariusz Kurman mkurman

Achievements

Achievements

synthetic-conversation-generator Public

Uh oh!

jepa-llm Public

Uh oh!

synthetic-questions-generation Public

Uh oh!

grpo-llm-evaluator Public

Uh oh!

augment-swebench-agent Public

Uh oh!

ii-researcher Public

Uh oh!

kron_torch Public

Uh oh!

nvamp-loss Public

Uh oh!

ReasonFlow Public

Uh oh!

fixed-size-kv-cache Public

Uh oh!

mcts-pytorch Public

Uh oh!

blurred-thoughts-SFT Public

Uh oh!

Large-Language-Model-Notebooks-Course Public

Uh oh!

self_reward_head_pytorch Public

Uh oh!

linearmoe_pytorch Public

Uh oh!

ai_agents_slide_deck Public archive

Uh oh!

hf_data_generation Public archive

Uh oh!