XPU-RT Scheduling and Runtime Integration

Project Description

XPU-RT is an adaptable full-stack end-to-end (E2E) compilation and scheduling flow for efficient mapping of robotic multi-model workloads onto heterogeneous shared-memory SoCs.

This project is under active development. If you would love to contribute or if you find any issues, please do so by opening a pull request or filing an issue on GitHub.

Repository Initialization

git clone https://github.com/ucb-bar/XPU-RT.git
cd XPU-RT
git submodule update --init --recursive

Set up `merlin`

Merlin provides the compiler toolchain, IREE runtime, and cross-compilation support used by XPU-RT. It ships as a git submodule under merlin/.

Prerequisites

Conda (Miniconda or Mamba)
Internet access for initial setup (toolchain downloads, submodule clones)

1) Install Environment

conda env create -f merlin/env_linux.yml
conda activate merlin-dev
uv sync

2) Build compiler tools, target runtime and toolchain

The one-step setup script handles everything (toolchain + host compiler + target runtime):

bash setup.sh

Or run merlin commands individually (from within the merlin/ directory):

cd merlin

# Install SpacemiT cross-compilation toolchain
uv run tools/merlin.py setup toolchain --toolchain-target spacemit

# Build host compiler tools
uv run tools/merlin.py build --profile vanilla --config release

# Build SpacemiT target runtime (includes xpu-rt plugin)
uv run tools/merlin.py build --profile spacemit --config perf

cd ..

See merlin/docs/getting_started.md and merlin/docs/reference/cli.md for the full Merlin CLI reference.

3) Compile models and profile on target

After building merlin, run these from the XPU-RT root:

runtime/scripts/compile_all_models.sh   # compile VMFB files for all models
runtime/scripts/profile_remote.sh       # profile on target device (e.g. BananaPi)

For runtime-specific setup and usage details, see runtime/README.md.

Developer note: using a separate merlin checkout

During active merlin development, you can use a standalone merlin checkout instead of the submodule. Either symlink it:

rm -rf merlin && ln -s /path/to/your/merlin merlin

Or set the MERLIN_DIR environment variable (respected by setup.sh, compile_all_models.sh, and build_runtime.sh):

export MERLIN_DIR=/path/to/your/merlin
bash setup.sh

Config your model parameters

Create data/toplevel/networks_periodic_profile.json if there is none, and add entries like:

"mlp": {
  "id": 1,
  "identifier":            # model name
  "dispatch_deps_path":    # path to model json 
  "period":                # Duration in millisec between excution windows (inverse of frequency)
  "window_duration":       # Duration in millisec for model to finish after window start 
}

Run XPU-RT Scheduler

Run basic demos on top-level network graph:

python scripts/run_xpurt_schedule.py --profiled

The optimal schedule of your workloads on your target will be found in schedules/scheduled_networks_periodic_profiled.json with visualization in 'plots/iree_combined_schedule_period.png' after it finishes.

[Optional] Run Baseline greedy Scheduler

Run greedy scheduler variant:

python scripts/run_greedy_schedule.py --use-grouped

Repository Map

XPU-RT/
├── xpu-rt/                    # Python scheduler core modules
│   ├── scheduler.py
│   ├── workload.py
│   ├── workload_factory.py
│   ├── packing.py
│   ├── plot.py
│   ├── schedule_validation.py
│   └── pytorch_workload/      # Sample model artifacts + dispatch JSON inputs
├── scripts/                   # Python entry points for experiments/scheduling
├── runtime/                   # Scripts for compile/profile flow + optional custom tools
│   ├── scripts/*.sh           #   compile_all_models, profile_remote, etc.
│   └── tools/                 #   Custom tool sources (links merlin's xpu-rt archive)
├── data/                      # Collected benchmark/profile/scheduling outputs
├── merlin/                    # Git submodule (compiler/runtime/tooling upstream)
│   ├── tools/merlin.py        #   Unified CLI: build, compile, setup, benchmark, ...
│   ├── samples/common/xpu-rt/ #   XPU-RT runtime library (baseline + scheduler runners)
│   ├── samples/SpacemiTX60/   #   SpacemiT-specific sample binaries
│   └── models/                #   Model definitions (MLIR/ONNX sources)
├── env.yml                    # Conda environment
└── setup.py                   # Editable pip install config

File-Level Integration Points

runtime/scripts/compile_all_models.sh -> calls merlin/tools/merlin.py compile
runtime/scripts/compile_all_models.sh -> compiles models under merlin/models/...
Pre-built runner binaries come from merlin/build/<profile>/runtime/plugins/merlin-samples/:
- merlin-baseline-async — baseline topo-order dispatch runner
- merlin-dispatch-scheduler — two-cluster scheduled dispatch runner
XPU-RT runtime C API headers: merlin/samples/common/xpu-rt/*.h
Standalone archive for custom tools / Zephyr: merlin/build/<build-name>/runtime/src/iree/runtime/libxpurt_standalone.a

Data/Artifact Flow Between This Repo and `merlin`

runtime/scripts/compile_all_models.sh generates VMFB + graph artifacts into gen/vmfb/... (using Merlin compiler).
runtime/scripts/profile_remote.sh runs topology benchmarks remotely and writes CSV results to gen/profile/....
scripts/run_xpurt_schedule.py reads profiled CSVs from gen/profile/... and combines them with dispatch graph JSON inputs to produce schedules.
Final scheduling outputs and logs are stored under data/... and script output directories.

Notes

The Python scheduler modules are sourced from xpu-rt/*.py and installed via setup.py.
Runtime C tooling in runtime/ is separate from Python scheduling code and is focused on Merlin/IREE integration.
If submodule contents are missing, runtime build/profile scripts will fail early.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
data		data
merlin @ 4582be0		merlin @ 4582be0
plots		plots
qnn_models		qnn_models
runtime		runtime
scripts		scripts
sims		sims
xpu-rt		xpu-rt
zephyr-chipyard-sw @ e23384f		zephyr-chipyard-sw @ e23384f
.contributors-refresh		.contributors-refresh
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
env.yml		env.yml
setup.py		setup.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

XPU-RT Scheduling and Runtime Integration

Project Description

Repository Initialization

Set up `merlin`

Prerequisites

1) Install Environment

2) Build compiler tools, target runtime and toolchain

3) Compile models and profile on target

Developer note: using a separate merlin checkout

Config your model parameters

Run XPU-RT Scheduler

[Optional] Run Baseline greedy Scheduler

Repository Map

File-Level Integration Points

Data/Artifact Flow Between This Repo and `merlin`

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

XPU-RT Scheduling and Runtime Integration

Project Description

Repository Initialization

Set up merlin

Prerequisites

1) Install Environment

2) Build compiler tools, target runtime and toolchain

3) Compile models and profile on target

Developer note: using a separate merlin checkout

Config your model parameters

Run XPU-RT Scheduler

[Optional] Run Baseline greedy Scheduler

Repository Map

File-Level Integration Points

Data/Artifact Flow Between This Repo and merlin

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Set up `merlin`

Data/Artifact Flow Between This Repo and `merlin`

Packages