Skip to content
View minyoungg's full-sized avatar

Block or report minyoungg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🧱 Modula software package

Python 330 29 Updated Aug 18, 2025
Python 699 65 Updated Apr 12, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,423 468 Updated May 20, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,330 2,730 Updated May 19, 2026

PyTorch native post-training library

Python 5,756 722 Updated May 20, 2026

Minimalistic large language model 3D-parallelism training

Python 2,696 307 Updated Apr 7, 2026

A framework for few-shot evaluation of language models.

Python 12,635 3,280 Updated May 11, 2026
Python 71 7 Updated Jul 11, 2024

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,903 638 Updated May 18, 2026

PyTorch native quantization and sparsity for training and inference

Python 2,828 507 Updated May 20, 2026

Mamba SSM architecture

Python 18,275 1,734 Updated May 10, 2026

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,210 572 Updated Aug 22, 2025

Python toolbox for optimization on Riemannian manifolds with support for automatic differentiation

Python 896 166 Updated Jun 2, 2025

Train transformer language models with reinforcement learning.

Python 18,422 2,729 Updated May 20, 2026

Fast and memory-efficient exact attention

Python 23,859 2,744 Updated May 20, 2026

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,349 723 Updated May 20, 2026

PyTorch extensions for high performance and large scale training.

Python 3,406 297 Updated Apr 26, 2025

Ongoing research training transformer models at scale

Python 16,406 3,980 Updated May 20, 2026

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,078 521 Updated Jul 1, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,219 6,681 Updated Sep 30, 2025

A fast, clean, responsive Hugo theme.

HTML 13,532 3,392 Updated May 10, 2026

Inference Llama 2 in one file of pure C

C 19,532 2,551 Updated Aug 6, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 58,498 10,051 Updated Nov 12, 2025

Foundation Architecture for (M)LLMs

Python 3,130 225 Updated Apr 11, 2024

Huggingface compatible implementation of RetNet (Retentive Networks, https://arxiv.org/pdf/2307.08621.pdf) including parallel, recurrent, and chunkwise forward.

Jupyter Notebook 226 26 Updated Mar 12, 2024

Generative Models by Stability AI

Python 27,160 3,085 Updated Dec 16, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 8,214 854 Updated May 20, 2026

Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".

Python 113 12 Updated Jun 8, 2023

Open source code for paper "On the Learning and Learnability of Quasimetrics".

C++ 32 1 Updated Nov 28, 2022
Next