A Llama 2 local finetuning demo code

This code repositoty is forked from Meta Llama-recipes and adds finetuning demo code on public book dataset. The finetuning demo has been tested on single-GPU. This code repository provides step-by-step instructions including how to setup the machine.

Preparation

A. High performance NVIDIA GPU. This could be on your local machine or use a cloud instance.

B. Install cuda and torch. Many cloud services already provide images installing cuda and torch. Another option is to use a docker image with cuda and torch. Otherwise, here are guidance assuming you only have a base Unbuntu(22.04): a. Install nvidia-driver:

# Remove existing Nvidia drivers
sudo apt autoremove nvidia* --purge
# Update Ubuntu before Nvidia driver installatoin
sudo apt update
sudo apt upgrade
# Find your graphics module:
lspci | grep -e 'VGA\|NVIDIA'
# Identify your graphics card and driver recommendation
sudo apt install ubuntu-drivers-common
ubuntu-drivers devices
# Install default driver or specify your preferred version
sudo ubuntu-drivers autoinstall # or sudo apt install nvidia-driver-<preferred version>
# Reboot. In CLI, type "nvidia-smi". You should be able to see a table listing driver version and cuda version`

b. Install miniconda

mkdir -p ~/miniconda3
wget https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86_64.sh -O ~/miniconda3/miniconda.sh
bash ~/miniconda3/miniconda.sh -b -u -p ~/miniconda3
rm -rf ~/miniconda3/miniconda.sh

c. Setup conda

conda init bash
source .bashrc
conda create -n myenv
conda activate myenv

d. Install pytorch

conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia

e. Install cuda from nvidia

C. Install Meta Llama 2 here. Make sure you can run Meta Llama 2 demo code on your machine.

D. Download Llama 2 models in Hagging-face format. They can be found here

E. Prepare finetuning dataset. The demo dataset can be found at xxx

Installation

Llama-recipes provides a pip distribution for easy install and usage in other projects. Alternatively, it can be installed from source.

Install with pip

pip install -e .

Fine-tuning example code

python -m llama_recipes.finetuning  --use_peft --peft_method lora --quantization --model_name <Llama-2-7b-chat-hf dir> --output_dir <output dir>

Inference example code

# Modify your question in llama-recipes-example/examples/books/book-q1.json
python examples/inference.py --model_name <Llama-2-7b-chat-hf dir> --peft_model <finetuning output dir> --quantization
# At prompt: type relative or absolute path of book-q1.json.

License

See the License file here and Acceptable Use Policy here

Name		Name	Last commit message	Last commit date
Latest commit History 431 Commits
.github		.github
.idea		.idea
demo_apps		demo_apps
docs		docs
examples		examples
scripts		scripts
src/llama_recipes		src/llama_recipes
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
UPDATES.md		UPDATES.md
USE_POLICY.md		USE_POLICY.md
dev_requirements.txt		dev_requirements.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Llama 2 local finetuning demo code

Table of Contents

Preparation

Installation

Install with pip

Fine-tuning example code

Inference example code

License

About

Releases

Packages

Languages

License

piscaries/llama-recipes-example

Folders and files

Latest commit

History

Repository files navigation

A Llama 2 local finetuning demo code

Table of Contents

Preparation

Installation

Install with pip

Fine-tuning example code

Inference example code

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages