#

transformers

Here are 3,041 public repositories matching this topic...

annotated_deep_learning_paper_implementations

labmlai / annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

machine-learning reinforcement-learning deep-learning transformers pytorch transformer gan neural-networks literate-programming attention lora deep-learning-tutorial optimizers

Updated Sep 19, 2025
Python

LLaMA-Factory

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Updated Nov 10, 2025
Python

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

computer-vision transformers artificial-intelligence image-classification attention-mechanism

Updated Nov 9, 2025
Python

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

python adapter transformers pytorch lora diffusion fine-tuning peft parameter-efficient-learning llm

Updated Nov 10, 2025
Python

arc53 / DocsGPT

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

react python search machine-learning natural-language-processing information-retrieval ai transformers pytorch language-model agents semantic-search hacktoberfest rag llm hacktoberfest2025 chatgpt agent-builder docsgpt

Updated Nov 7, 2025
Python

stas00 / ml-engineering

Machine Learning Engineering Open Book

training machine-learning ai scalability transformers slurm inference pytorch machine-learning-engineering mlops large-language-models llm

Updated Oct 27, 2025
Python

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

transformers model-para large-language-models

Updated Nov 10, 2025
Python

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated Nov 8, 2025
Python

PaddleNLP

PaddlePaddle / PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo.

nlp search-engine compression sentiment-analysis transformers information-extraction question-answering llama pretrained-models embedding bert semantic-analysis distributed-training ernie neural-search uie document-intelligence paddlenlp llm

Updated Nov 7, 2025
Python

txtai

neuml / txtai

💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows

Updated Nov 7, 2025
Python

qubvel-org / segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

Updated Oct 29, 2025
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Nov 7, 2025
Python

intel / ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

gpu transformers pytorch llm

Updated Oct 14, 2025
Python

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

reinforcement-learning raylib transformers proximal-policy-optimization large-language-models reinforcement-learning-from-human-feedback vllm openai-o1

Updated Nov 9, 2025
Python

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

transformers gpt language-model gpt-2 gpt-3

Updated Feb 25, 2022
Python

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

reinforcement-learning deep-learning transformers artificial-intelligence attention-mechanisms human-feedback

Updated Oct 11, 2025
Python

bertviz

jessevig / bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

visualization nlp machine-learning natural-language-processing neural-network transformers pytorch transformer bert roberta gpt2

Updated Jun 1, 2025
Python

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

transformers language-model gpt-3 deepspeed-library

Updated Sep 26, 2025
Python

BERTopic

MaartenGr / BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

nlp machine-learning topic transformers topic-modeling bert topic-models sentence-embeddings topic-modelling ldavis

Updated Nov 5, 2025
Python

SkalskiP / courses

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

nlp machine-learning natural-language-processing tutorial deep-neural-networks computer-vision deep-learning transformers generative-model multimodal mlops stable-diffusion

Updated Apr 22, 2024
Python

Improve this page

Add a description, image, and links to the transformers topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the transformers topic, visit your repo's landing page and select "manage topics."