llm-training

Here are 6 public repositories matching this topic...

delveopers / Shredword

Fast & efficient BPE tokenizer written in C & python for LLM tranining

tokenizer tokenization sub-words tiktoken llm-training

Updated Oct 25, 2025
C++

Carbon is a pure C++ Transformer framework inspired by GPT, featuring SIMD-optimized tensor math, multi-head attention, feedforward layers, and BPE tokenization. It’s a fully self-contained system for training and running language models without external modules or libraries.

machine-learning deep-learning cpp transformer llm llm-training

Updated Nov 4, 2025
C++

gruai / koifish

Star

A c++ framework on efficient training & fine-tuning LLMs

deep-learning llms llm-training finetuning-llms

Updated Dec 11, 2025
C++

AndyLu666 / MobileFineTuner

Star

MobileFineTuner: Native C++ framework for fine-tuning LLMs directly on mobile devices. Features: LoRA/Full-FT, ZeRO-inspired parameter sharding, energy-aware throttling, custom autograd engine. Keep your data on-device.

mobile-development mlsystem llm-training