nano-llm

A lightweight CLI for simple model preprocessing, training, evaluation, and optimization.

Overview

nano-llm is a streamlined command-line interface for simple transformer model development. It provides a complete workflow from data preprocessing to model training, evaluation, and text generation, designed for educational purposes and small-scale experiments.

Key Features

Tokenizer Training: Train custom tokenizers on your text data
Model Training: Train transformer-based language models
Model Evaluation: Evaluate models with perplexity metrics
Text Generation: Generate text using trained models
YAML Configuration: Simple configuration-based approach

TODO Features (Planned Enhancements)

Model Optimization: Implement pruning and distillation techniques
Advanced Positional Encoding: Add Rotary positional encoding (RoPE) for better performance
Efficient Normalization: Replace LayerNorm with RMSNorm for improved efficiency
HuggingFace Integration: Use the transformers package to enable model deployment to HuggingFace Hub

Quick Start

Installation

# Clone the repository
git clone https://github.com/ssubedir/nano-llm.git
cd nano-llm

# Install dependencies
uv sync

Basic Usage

For a complete workflow example, see the Getting Started Guide.

Documentation

For detailed information about commands, configuration, and examples, see the docs folder:

Getting Started - Complete workflow tutorial
Configuration Reference - All configuration options
Command Guides - Detailed command documentation:

Project Structure

nano-llm/
├── app/                   # Core application code
│   ├── cli/               # Command-line interface
│   ├── data/              # Data processing
│   └── model/             # Model architecture
├── configs/               # Configuration files
├── dataset/               # Sample datasets
└── docs/                  # Documentation

Requirements

Python 3.12+
PyTorch
CUDA-compatible GPU (recommended for training)

Development

# Lint all files in the current directory
uvx ruff check

# Format all files in the current directory
uvx ruff format

Contributing

Contributions are welcome!

Fork the repository
Create a feature branch
Make your changes
Run linting and formatting:
```
uvx ruff check
uvx ruff format
```
Submit a pull request

See the TODO section for areas that need work.

License

MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
app		app
configs		configs
dataset		dataset
docs		docs
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
config.yaml		config.yaml
pyproject.toml		pyproject.toml
ruff.toml		ruff.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nano-llm

Overview

Key Features

TODO Features (Planned Enhancements)

Quick Start

Installation

Basic Usage

Documentation

Project Structure

Requirements

Development

Contributing

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

nano-llm

Overview

Key Features

TODO Features (Planned Enhancements)

Quick Start

Installation

Basic Usage

Documentation

Project Structure

Requirements

Development

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages