-
MedIT Solutions Kurman i Wspólnicy Sp. z o. o.
- in/mariuszkurman
- @mkurman88
- https://huggingface.co/mkurman
-
A Python script to programmatically generate synthetic conversations for training LLMs, chatbots, and dialogue systems.
-
jepa-llm Public
Fine-tuning causal language models with an additional JEPA-style representation regularisation loss as well as a plain Hugging Face trainer
-
-
grpo-llm-evaluator Public
Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluations.
-
augment-swebench-agent Public
Forked from augmentcode/augment-swebench-agentThe #1 open-source SWE-bench Verified implementation
Python Other UpdatedApr 9, 2025 -
ii-researcher Public
Forked from Intelligent-Internet/ii-researcherII-Researcher: a new open-source framework designed to aid building search / research agents
Python Apache License 2.0 UpdatedMar 29, 2025 -
kron_torch Public
Forked from evanatyourservice/kron_torchAn implementation of PSGD Kron second-order optimizer for PyTorch
Python Creative Commons Attribution 4.0 International UpdatedMar 24, 2025 -
nvamp-loss Public
Normalized Variance-Aware Max-Penalized Loss
-
ReasonFlow Public
ReasonFlow is a novel framework designed to implement o1-like reasoning capabilities in large language models.
-
fixed-size-kv-cache Public
This project implements a fixed-size key-value cache for use in transformer models, specifically designed to work with the LLaMA model. The cache dynamically truncates the key and value states base…
-
mcts-pytorch Public
A flexible Monte Carlo Tree Search framework with PyTorch for decision-making in language models.
-
blurred-thoughts-SFT Public
Blurred-Thoughts Supervised-Finetuning (BT-SFT) is a new approach to fine-tuning language models, focusing on enhancing response diversity and creativity.
-
Large-Language-Model-Notebooks-Course Public
Forked from peremartra/Large-Language-Model-Notebooks-CoursePractical course about Large Language Models.
-
self_reward_head_pytorch Public
This repository contains the implementation of a self-reward head designed for language models. The self-reward head enables the model to autonomously score its generated outputs, promoting self-as…
-
linearmoe_pytorch Public
This repo contains my custom implementation of a mixture of experts as an extension of the linear layer.
-
ai_agents_slide_deck Public archive
The repository houses the code and resources for running intelligent agents that automatically create comprehensive slide decks on specified topics..
Python Apache License 2.0 UpdatedJun 3, 2024 -
hf_data_generation Public archive
This repo contains simple jupyter notebook based machine-learning instruction datasets generation using open-sourced huggingface models and hugginface pro subscription.
Python UpdatedMay 30, 2024