Skip to content
View Riccorl's full-sized avatar
🦄
Magic
🦄
Magic

Block or report Riccorl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Production

91 repositories

😎 A curated list of awesome MLOps tools

Python 4,911 657 Updated Dec 10, 2025

ZenML 🙏: One AI Platform from Pipelines to Agents. https://zenml.io.

Python 5,107 563 Updated Dec 19, 2025

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,222 611 Updated Dec 19, 2025

ONNX-TensorRT: TensorRT backend for ONNX

C++ 3,176 548 Updated Nov 6, 2025

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,515 5,911 Updated Dec 21, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,022 3,202 Updated Dec 19, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,424 7,021 Updated Dec 21, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 10,151 1,695 Updated Dec 20, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,489 2,294 Updated Dec 11, 2025

Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)

Rust 24 6 Updated Jul 3, 2022

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Python 1,689 154 Updated Oct 23, 2024

Transformer related optimization, including BERT, GPT

C++ 6,371 927 Updated Mar 27, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 14,661 3,390 Updated Aug 12, 2024

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 18,723 3,610 Updated Dec 20, 2025

The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️

Python 3,608 241 Updated May 29, 2025

Edge Inference in Browser with Transformer NLP model

Jupyter Notebook 314 57 Updated Sep 27, 2022

Machine Learning Pipelines for Kubeflow

Python 4,021 1,857 Updated Dec 19, 2025

Exercises and supplementary material for the machine learning operations course at DTU.

Python 739 563 Updated Dec 21, 2025

Fast model deployment on any cloud 🚀

Python 175 30 Updated Feb 25, 2024

Standardized Distributed Generative and Predictive AI Inference Platform for Scalable, Multi-Framework Deployment on Kubernetes

Shell 4,926 1,324 Updated Dec 19, 2025

asyncio (PEP 3156) Redis support

Python 2,304 331 Updated Feb 20, 2023

Serve, optimize and scale PyTorch models in production

Java 4,358 886 Updated Aug 6, 2025

Learn how to design, develop, deploy and iterate on production-grade ML applications.

Jupyter Notebook 45,203 7,068 Updated Aug 18, 2024

A Redis module for serving tensors and executing deep learning graphs

C 841 106 Updated Aug 20, 2025

Python client for RedisAI

Python 88 13 Updated Jul 13, 2023

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.

Python 126 8 Updated Apr 6, 2022

Deploy a ML inference service on a budget in less than 10 lines of code.

Python 1,345 64 Updated Feb 12, 2024

MLOps Platform

Mustache 272 40 Updated Oct 28, 2024

Kubernetes-friendly ML model management, deployment, and serving.

Go 180 49 Updated Dec 17, 2025

🪄 Turns your machine learning code into microservices with web API, interactive GUI, and more.

Python 3,136 164 Updated Dec 5, 2025