ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
-
Updated
Jun 12, 2026 - C++
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Implement a Pytorch-like DL library in C++ from scratch, step by step
A simple neural network inference framework
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
Infrastructure to enable deployment of ML models to low-power resource-constrained embedded targets (including microcontrollers and digital signal processors).
High-performance C++20 neural network framework powered by Intel oneAPI MKL 2025.2. Optimized for CPU-based deep learning inference and training.
AI framework for automatic development of C++ and Python applications
LiteRT continues the legacy of TensorFlow Lite as the trusted, high-performance runtime for on-device AI. Now with LiteRT Next, we're expanding our vision with a new generation of APIs designed for superior performance and simplified hardware acceleration. Discover what's next for on-device AI.
TFLite Support is a toolkit that helps users to develop ML and deploy TFLite models onto mobile / ioT devices.
一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
ncnn is a high-performance neural network inference framework optimized for the mobile platform
Optimization Framework for Tosa-Dialect (MLIR) based Distributed or NUMA targeted workloads
An Open Source Machine Learning Framework for Everyone
Add a description, image, and links to the ai-framework topic page so that developers can more easily learn about it.
To associate your repository with the ai-framework topic, visit your repo's landing page and select "manage topics."