max

MAX framework

MAX is a high-performance inference server that provides an OpenAI-compatible endpoint for large language models (LLMs) and it's a fundamental component of the Modular Platform.

This directory includes the source for our Python-based inference server, Python-based model pipelines (graphs), Python-based neural-net operators (high-level graph ops), Mojo-based kernel functions (low-level graph ops for GPUs and CPUs), and more.

Usage

With just a few commands, you can use MAX to create a local endpoint serving a large language model (LLM) of your choice, using our CLI tool or Docker container. Try it now with our quickstart guide.

Contributing

Thanks for your interest in contributing to MAX!

We welcome contributions to this repo on the main branch. Please first read our Contributor Guide.

If you want to report issues or request features, please create a GitHub issue here—also see our guide to submitting good bug reports.

Contact us

If you'd like to chat with the team and other community members, please send a message to our Discord channel and our forum board.

Name		Name	Last commit message	Last commit date
parent directory ..
compiler/src		compiler/src
docs		docs
examples		examples
include/max/c		include/max/c
kernels		kernels
python		python
tests		tests
BUILD.bazel		BUILD.bazel
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

MAX framework

Usage

Contributing

Contact us

FilesExpand file tree

max

Directory actions

More options

Directory actions

More options

Latest commit

History

max

Folders and files

parent directory

README.md

MAX framework

Usage

Contributing

Contact us