transformers

Star

Here are 3,048 public repositories matching this topic...

hiyouga / LLaMA-Factory

Star

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Updated Nov 12, 2025
Python

labmlai / annotated_deep_learning_paper_implementations

Star

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

machine-learning reinforcement-learning deep-learning transformers pytorch transformer gan neural-networks literate-programming attention lora deep-learning-tutorial optimizers

Updated Nov 11, 2025
Python

lucidrains / vit-pytorch

Star

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

computer-vision transformers artificial-intelligence image-classification attention-mechanism

Updated Nov 9, 2025
Python

NVIDIA / Megatron-LM

Star

Ongoing research training transformer models at scale

transformers model-para large-language-models

Updated Nov 12, 2025
Python

PaddlePaddle / PaddleNLP

Star

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

nlp search-engine compression sentiment-analysis transformers information-extraction question-answering llama pretrained-models embedding bert semantic-analysis distributed-training ernie neural-search uie document-intelligence paddlenlp llm

Updated Nov 12, 2025
Python

huggingface / peft

Star

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

python adapter transformers pytorch lora diffusion fine-tuning peft parameter-efficient-learning llm

Updated Nov 12, 2025
Python

arc53 / DocsGPT

Sponsor

Star

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

react python search machine-learning natural-language-processing information-retrieval ai transformers pytorch language-model agents semantic-search hacktoberfest rag llm hacktoberfest2025 chatgpt agent-builder docsgpt

Updated Nov 7, 2025
Python

qubvel-org / segmentation_models.pytorch

Star

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Updated Oct 29, 2025
Python

speechbrain / speechbrain

Star

A PyTorch-based Speech Toolkit

Updated Nov 7, 2025
Python

intel / ipex-llm

Star

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

gpu transformers pytorch llm

Updated Oct 14, 2025
Python

EleutherAI / gpt-neox

Star

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

transformers language-model gpt-3 deepspeed-library

Updated Sep 26, 2025
Python

BlinkDL / RWKV-LM

Star

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated Nov 8, 2025
Python

EleutherAI / gpt-neo

Star

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

transformers gpt language-model gpt-2 gpt-3

Updated Feb 25, 2022
Python

stas00 / ml-engineering

Star

Machine Learning Engineering Open Book

training debugging machine-learning ai storage network scalability transformers slurm inference pytorch machine-learning-engineering gpus mlops large-language-models llm

Updated Oct 27, 2025
Python

MaartenGr / BERTopic

Star

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

nlp machine-learning topic transformers topic-modeling bert topic-models sentence-embeddings topic-modelling ldavis

Updated Nov 11, 2025
Python

jessevig / bertviz

Star

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

visualization nlp machine-learning natural-language-processing neural-network transformers pytorch transformer bert roberta gpt2

Updated Jun 1, 2025
Python

microsoft / presidio

Star

An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.

Updated Nov 12, 2025
Python

OpenRLHF / OpenRLHF

Star

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

reinforcement-learning raylib transformers proximal-policy-optimization large-language-models reinforcement-learning-from-human-feedback vllm openai-o1