supervised-finetuning

Here are 30 public repositories matching this topic...

InternLM / xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

agent chatbot conversational-ai peft baichuan msagent large-language-models llm supervised-finetuning llava llm-training chatglm2 internlm llama2 qwen chatglm3 mixtral llama3 phi3

Updated Aug 21, 2024
Python

InternLM / InternLM-XComposer

Star

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

foundation gpt language-model multimodal multi-modality vision-transformer gpt-4 visual-language-learning llm chatgpt instruction-tuning large-language-model supervised-finetuning mllm vision-language-model large-vision-language-model

Updated Aug 30, 2024
Python

GaryYufei / AlignLLMHumanSurvey

Star

Aligning Large Language Models with Human: A Survey

awesome survey llama gpt-4 large-language-models llms chatgpt rlhf supervised-finetuning llama2 chinese-llama

Updated Sep 11, 2023

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

Star

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.

compression feedback survey alignment self-training multi-modal knowledge-distillation data-augmentation kd data-synthesis self-distillation instruction-following llm large-language-model supervised-finetuning

Updated Sep 11, 2024

chaoswork / sft_datasets

Star

开源SFT数据集整理,随时补充

datasets chinese-dataset large-language-models llms supervised-finetuning

Updated Jun 2, 2023

magpie-align / magpie

Star

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

nlp paper dataset alignment gemma synthetic-data synthetic-dataset-generation llm supervised-finetuning llama2 qwen2 llama3 phi3

Updated Aug 28, 2024
Python

LIN-SHANG / InstructERC

Star

The offical realization of InstructERC

unified-data-processing emotion-recognition-in-conversation large-language-models supervised-finetuning chatglm-6b llama-7b chatglm2-6b llama2-7b

Updated Jul 16, 2024
Python

sail-sg / sdft

Star

[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".

language-model self-distillation supervised-finetuning

Updated Aug 22, 2024
Shell

ZhengxiangShi / InstructionModelling

Star

Code for the paper titled "Instruction Tuning With Loss Over Instructions"

natural-language-processing language-model instruction-tuning supervised-finetuning

Updated May 24, 2024
Python

fanqiwan / KCA

Star

Knowledge Verification to Nip Hallucination in the Bud

machine-learning hallucination large-language-models supervised-finetuning

Updated Mar 10, 2024
Python

quanshr / AugCon

Star

Automatically Generating Numerous Context-Driven SFT Data for LLMs across Diverse Granularity

synthetic-data large-language-model supervised-finetuning

Updated Aug 17, 2024
Python

bhattbhavesh91 / google-gemma-finetuning-n2sql

Sponsor

Star

Finetuning Google's Gemma Model for Translating Natural Language into SQL

google lora gemma natural-language-to-sql fine-tuning finetuning supervised-finetuning finetuning-llms

Updated Feb 22, 2024
Jupyter Notebook

BUAADreamer / MLLM-Finetuning-Demo

Star

使用LLaMA-Factory微调多模态大语言模型的示例代码 Demo of Finetuning Multimodal LLM with LLaMA-Factory

transformers lora pretraining huggingface-datasets supervised-finetuning mllm llava finetune-llm llama-factory paligemma yi-vl

Updated Sep 8, 2024
Python

sovit-123 / lm_sft

Star

Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks

gpt bert gemma gpt2 large-language-models llms supervised-finetuning

Updated May 16, 2024
Jupyter Notebook

tien02 / llm-math

Star

Fine tune Large Language Model on Mathematic dataset

mathematics transformer llama lora huggingface llm supervised-finetuning llama2

Updated Dec 4, 2023
Python

KwokHing / AI-Planet-LLM-Bootcamp-Challenge

Star

An LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain