deepspeed

Star

Here are 120 public repositories matching this topic...

InternLM / lmdeploy

Star

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

llama cuda-kernels deepspeed llm fastertransformer llm-inference turbomind internlm llama2 codellama llama3

Updated May 14, 2026
Python

PKU-Alignment / safe-rlhf

Star

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Updated Nov 24, 2025
Python

zjunlp / KnowLM

Star

An Open-sourced Knowledgable Large Language Model Framework.

Updated Jan 11, 2025
Python

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train your own 8B/14B LLaVA-training-like MLLM on RTX3090/4090 24GB.

fine-tuning pipeline-parallelism pretraining model-parallel deepspeed mllm multimodal-large-language-models qwen video-large-language-models video-language-model

Updated Mar 10, 2025
Jupyter Notebook

alibaba / Megatron-LLaMA

Star

Best practice for training LLaMA models in Megatron-LM

pytorch llama distributed-training pretraining deepspeed megatron-lm llm

Updated Jan 2, 2024
Python

LambdaLabsML / distributed-training-guide

Star

Best practices & guides on how to write distributed pytorch training code

gpu cluster mpi cuda slurm pytorch sharding kuberentes distributed-training nccl gpu-cluster deepspeed fsdp lambdalabs

Updated Oct 22, 2025
Python

antgroup / glake

Star

GLake: optimizing GPU memory management and IO transmission.

memory gpu pytorch onnx deepspeed llm

Updated Mar 24, 2025
Python

shm007g / LLaMA-Cult-and-More

Star

Large Language Models for All, 🦙 Cult and More, Stay in touch !

tensorflow transformers pytorch llama gpt alpaca loralib vicuna deepspeed gpt4 llm chatgpt ggml gptq

Updated Jun 1, 2023
HTML

Xirider / finetune-gpt2xl

Star

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

finetuning gpt2 huggingface huggingface-transformers gpt3 deepspeed gpt-neo gpt-neo-fine-tuning

Updated Jun 14, 2023
Python

OpenMOSS / CoLLiE

Star

Collaborative Training of Large Language Models in an Efficient Way

nlp deep-learning pytorch deepspeed

Updated Aug 28, 2024
Python

openpsi-project / ReaLHF

Star

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

distributed-systems reinforcement-learning distributed-computing transformers large-scale-machine-learning deepspeed megatron-lm large-language-models llm reinforcement-learning-from-human-feedback llm-training llm-framework

Updated Apr 24, 2025
Python

sunzeyeah / RLHF

Star

Implementation of Chinese ChatGPT

nlp deep-learning pytorch glm pangu deepspeed chatgpt

Updated Nov 20, 2023
Python

stanleylsx / llms_tool

Star

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

bloom pytorch falcon llama moss mistral aquila baichuan deepspeed chatglm chatglm2 internlm llama2 qwen xverse baichuan2 aquila2 chatglm3

Updated Dec 8, 2023
Python

bobo0810 / LearnDeepSpeed

Star

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

examples deepspeed large-language-models

Updated Sep 7, 2023
Python

git-cloner / llama2-lora-fine-tuning

Star

llama2 finetuning with deepspeed and lora

lora finetuning deepspeed llama2

Updated Jul 28, 2023
Python

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

pytorch llama gpt lora finetune ppo peft deepspeed llm chatgpt rlhf reward-models chatglm chatglm-6b

Updated Apr 28, 2023
Python

HomebrewML / revlib

Star

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

deep-learning pytorch tpu revnet xla deepspeed momentumnet

Updated Aug 6, 2022
Python

CoinCheung / gdGPT

Star

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

nlp bloom pipeline pytorch deepspeed llm full-finetune model-parallization flash-attention llama2 baichuan2-7b chatglm3-6b mixtral-8x7b

Updated Feb 5, 2024
Python

OpenCSGs / llm-inference

Star

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

transformer ray deepspeed llama-cpp vllm llm-inference

Updated May 17, 2024
Python

xyjigsaw / LLM-Pretrain-SFT

Star

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

llama lora mistral deepspeed large-language-models baichuan2

Updated Jan 30, 2024
Python

Improve this page

Add a description, image, and links to the deepspeed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeed topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepspeed

Here are 120 public repositories matching this topic...

InternLM / lmdeploy

PKU-Alignment / safe-rlhf

zjunlp / KnowLM

Coobiw / MPP-LLaVA

alibaba / Megatron-LLaMA

LambdaLabsML / distributed-training-guide

antgroup / glake

shm007g / LLaMA-Cult-and-More

Xirider / finetune-gpt2xl

OpenMOSS / CoLLiE

openpsi-project / ReaLHF

sunzeyeah / RLHF

stanleylsx / llms_tool

bobo0810 / LearnDeepSpeed

git-cloner / llama2-lora-fine-tuning

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

HomebrewML / revlib

CoinCheung / gdGPT

OpenCSGs / llm-inference

xyjigsaw / LLM-Pretrain-SFT

Improve this page

Add this topic to your repo