NanoGPT

In this project, I trained a 10M parameters GPT-like model based on this great video from Andrew Karpathy: Let's build GPT: from scratch, in code, spelled out.. I used the tiny Shakespeare dataset. The final output is a model able to produce text that has the same structure as a typical Shakespeare text (but the English is far from being grammatically and syntactically correct). This project focuses only on the pretraining stage of LLM training.

How to use

Start by creating a virtual environment and installing the dependencies:

uv venv --python 3.12
source .venv/bin/activate
uv pip install -r requirements.txt

Generate text using the last trained model:

python generate_text.py --checkpoint-path ./checkpoints/run_20241228_212938/model.pt --max-tokens 1000

--checkpoint-path:  Path to the pre-trained model file
--max-tokens:       Maximum number of tokens to generate

Files overview

config.py: hyperparameters and configuration
generate_text.py: command-line interface for generating text using the pre-trained model
get_metadata.py: get some metadata about the model
load.py: load the model from a given .pt file
model.py: definition of the different components of the GPT model
preprocessing.py: loading and preprocessing of the data
training.py: training loop and model saving
prototype.py: first draft of the code, all the steps detailed above in a single file

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NanoGPT

How to use

Files overview

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
checkpoints		checkpoints
data		data
.gitignore		.gitignore
README.md		README.md
config.py		config.py
generate_text.py		generate_text.py
get_metadata.py		get_metadata.py
load.py		load.py
model.py		model.py
preprocessing.py		preprocessing.py
prototype.py		prototype.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
training.py		training.py

abenki/nanogpt

Folders and files

Latest commit

History

Repository files navigation

NanoGPT

How to use

Files overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages