Skip to content
View Valdanitooooo's full-sized avatar
😅
Focusing
😅
Focusing

Organizations

@detectiveboys

Block or report Valdanitooooo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

model-training

17 repositories

A quick guide (especially) for trending instruction finetuning datasets

3,328 226 Updated Nov 28, 2023

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 1,017 100 Updated Apr 27, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,043 4,669 Updated Dec 19, 2025

Example models using DeepSpeed

Python 6,749 1,112 Updated Dec 19, 2025

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,219 120 Updated Mar 10, 2024

An Open-Source Framework for Prompt-Learning.

Python 4,793 485 Updated Jul 16, 2024

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,735 136 Updated Mar 13, 2024

methods2test is a supervised dataset consisting of Test Cases and their corresponding Focal Methods from a set of Java software repositories

Python 172 46 Updated Dec 4, 2023

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Python 4,756 507 Updated Dec 16, 2025

LLM training code for Databricks foundation models

Python 4,371 578 Updated Oct 27, 2025

chatglm多gpu用deepspeed和

Python 411 60 Updated Jul 8, 2024

Full description can be found here: https://discuss.huggingface.co/t/pretrain-gpt-neo-for-open-source-github-copilot-model/7678?u=ncoop57

Python 3,287 220 Updated Jan 18, 2022

Plain pytorch implementation of LLaMA

Python 188 28 Updated May 22, 2023

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,754 1,070 Updated Dec 20, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,249 7,787 Updated Dec 19, 2025

A dataset template for guiding chat-models to self-cognition, including information about the model’s identity, capabilities, usage, limitations, etc.

Python 30 2 Updated Sep 4, 2023

Praetor is a lightweight finetuning data and prompt management tool

Python 67 Updated Nov 16, 2024