TNT's Not a Transpiler

This is a short demo of using __torch_dispatch__ and __torch_function__ modes to accomplish the registration of an Apple MLX backend in PyTorch using Python only.

In this setup, every torch op is actually executed using MLX.

This idea is described in https://dev-discuss.pytorch.org/t/embrace-tensor-subclass-as-a-python-device-registration-api/2771 and inspired from https://github.com/albanD/subclass_zoo/blob/main/new_device.py

torchax Also uses the same mechanism but using Jax as the backend.

In this example, we only registers the bare-minimum to be able to run the llama model. I used a much more smaller scale of the model, however all the operators needed to run the full version (say 8B) is completed here.

Rough steps:

Define your payload type (here it is mlx.array).
Tell the Environment how to transform a torch.Tensor to your payload and vice-versa
Register the ATen ops that needs to run your model. Each registration is to implement the logic of an ATen op using MLX ops. LLMs can help a lot here.

After all done: To run it:

python torch_mlx.py

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
tnt		tnt
.gitignore		.gitignore
README.md		README.md
llama.py		llama.py
torch_mlx.py		torch_mlx.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TNT's Not a Transpiler

Rough steps:

About

Uh oh!

Releases

Packages

Languages

qihqi/tnt

Folders and files

Latest commit

History

Repository files navigation

TNT's Not a Transpiler

Rough steps:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages