#

sft

Here are 121 public repositories matching this topic...

Tendo33 / data-tagger

Efficient, Flexible, Multi-task Batch SFT Data Labeling Tool

labeling-tool sft

Updated Aug 18, 2025
Python

zhangchenhaobest / pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

ai-framework sft

Updated Sep 5, 2025
Python

Nexdata-AI / 100000-Instruction-Following-Evaluation-SFT-for-Chinese-LLM-Text-Data

100000-Instruction-Following-Evaluation-SFT-for-Chinese-LLM-Text-Data

nlp sft large-language-models llm-training

Updated Aug 8, 2024

tripolskypetr / agent-tune

A React-based tool for constructing fine-tuning datasets with list and grid forms, featuring the ability to download and upload data as JSONL files. This project leverages the react-declarative library to create dynamic, interactive forms for defining user inputs, preferred outputs, and non-preferred outputs, along with associated tools

react ai openai llama mui fine-tuning react-declarative sft dpo llm chatgpt agent-swarm

Updated Apr 7, 2025
TypeScript

zhangchenhaobest / unsloth

Finetune Qwen3, Llama 4, TTS, DeepSeek-R1 & Gemma 3 LLMs 2x faster with 70% less memory! 🦥

utility tool sft llms

Updated Sep 6, 2025
Python

gaurisharan / SmolLM2-FT-MyDataset

Supervised Finetuning on SmolLM model using the smalltalk dataset.

sft smollm smolagents smoltalk

Updated Jul 25, 2025
Jupyter Notebook

DoctorGemma-3

AtharvaTaras / DoctorGemma-3

Gemma3 4B LLM Fine Tuned on 100K Doctor-Patient QA Dataset

python nlp natural-language-processing pytorch dataset lora gemma peft finetuning sft huggingface huggingface-transformers llm medical-nlp unsloth gemma3 gemma3-4b unsloth-zoo

Updated Jul 27, 2025
Jupyter Notebook

klevze / sqlBackup

A modern Python-based MySQL backup tool with flexible archiving, multi-channel notifications (Telegram, Email, SMS, etc.), remote uploads (SFTP, FTP, SCP), and robust configuration validation.

Updated Aug 28, 2025
Python

FrancescoDiSalesGithub / few-shots-importer

sft training by using only command instruction on a ollama modelfile

training ai hack supervised-learning sft ollama modelfile

Updated Feb 16, 2025
Python

zhangchenhaobest / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

utility reinforcement-learning tool evaluation ai-framework study-materials sft llm

Updated Apr 30, 2025
Python

moritz-rsth / finetune-primer

This repo is making it accessible to fine tune your own LLM with SFT and DPO

lora sft dpo llm gpt-oss

Updated Dec 7, 2025
Python

YassWorks / fine-tuning-study

A case study of a model evaluation and light fine-tuning. This features SFT, DAFT and a few PEFT methods.

ai daft fine-tuning peft sft llms

Updated Jun 30, 2025
Python

MilyaushaShamsutdinova / MedAdapt-LLM

Adapting LLM to the medical domain through SFT, RAG, and multistep fine-tuning to enhance domain knowledge and performance.

nlp medicine fine-tuning rag sft llm continued-pretraining

Updated May 12, 2025
Jupyter Notebook

zhangchenhaobest / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

database evaluation ai-framework study-materials data-set medical-databases sft llm llms medical-database medical-evaluation-framework medical-llms

Updated May 19, 2025

zhangchenhaobest / easy-dataset

A powerful tool for creating fine-tuning datasets for LLM

utility tool data-extraction data-set data-governance sft

Updated Sep 5, 2025
JavaScript

sarabesh / Finetuning

Repo to serve as a baseline/guide for performing post training(SFT/RLHF) of modern LLM models, and evaluating them with baseline datasets.

evaluation finetuning sft huggingface rlhf finetune-llms

Updated Aug 6, 2025
Jupyter Notebook

PhilipMay / llm-data

LLM Training Data

sft llm

Updated Apr 7, 2024
Jupyter Notebook

Nexdata-AI / 100K-English-Instruction-Tuning-Dataset-General-Domain-SFT-for-LLM-Fine-Tuning

sft llm

Updated Sep 28, 2025

0-1CxH / megatron-wrap

Wrapped Megatron: As User-Friendly as HuggingFace, As Powerful as Megatron-LM | Megatron封装：和HuggingFace一样方便，和Megatron-LM一样强大

wrapper simplification transformer wrap llama gpt sft megatron huggingface-transformers deepspeed megatron-lm llm llm-training

Updated Mar 22, 2025
Python

shreyansh26 / wordle-solver

Training Qwen3 to solve Wordle using SFT and GRPO

rl wordle sft rft tensor-parallelism wordle-solver llm fsdp grpo qwen3

Updated Sep 22, 2025
Python

Improve this page

Add a description, image, and links to the sft topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the sft topic, visit your repo's landing page and select "manage topics."