Skip to content
View inisis's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report inisis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Symbolic shape inference for ONNX

Python 4 Updated Feb 14, 2026

Open ABI and FFI for Machine Learning Systems

C++ 346 60 Updated Feb 16, 2026
Python 111 15 Updated Feb 12, 2026

the jax version for nano-vllm

Python 1 Updated Jan 2, 2026

:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.

Python 143,141 11,117 Updated Jan 28, 2026

一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework

C++ 1,738 212 Updated Feb 15, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,241 1,698 Updated Dec 17, 2025

A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.

C++ 22 2 Updated Jan 4, 2026

A lightweight, single-header C++11 Jinja2 template engine for LLM chat templates.

C++ 18 5 Updated Jan 4, 2026

A collection of practical, end-to-end AI application examples accelerated by MemryX hardware and software solutions. This repository offers examples for real-time video inference, object detection…

Python 4 1 Updated Jan 29, 2026

JAX bindings for Flash Attention v2

C++ 103 9 Updated Feb 5, 2026

Tokamax: A GPU and TPU kernel library.

Python 172 14 Updated Feb 17, 2026

Customizable Reinforcement Learning

Python 195 16 Updated Feb 12, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,997 274 Updated Feb 17, 2026

The repository provides code for running inference with the Meta Segment Anything Model 3 (SAM 3).

Python 54 9 Updated Jan 31, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,197 452 Updated Feb 16, 2026

SAM 3D Objects

Python 5,990 655 Updated Feb 3, 2026

Ultralytics YOLO 🚀

Python 53,345 10,212 Updated Feb 16, 2026

The best ChatGPT that $100 can buy.

Python 43,488 5,667 Updated Feb 16, 2026

A Toolkit to Help Optimize Onnx Model

Python 426 33 Updated Feb 11, 2026
1 Updated Aug 28, 2025

Multi-stream video inference with Ultralytics YOLO - Display multiple video streams in a grid layout with real-time object detection.

Python 15 3 Updated Dec 30, 2025

🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime

Python 118 37 Updated Feb 16, 2026

A high-performance tool for video upscaling, interpolation, depth estimation, and more. Available as a CLI and Adobe Extension.

Python 246 8 Updated Feb 16, 2026

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,291 618 Updated Feb 9, 2026

Use safetensors with ONNX 🤗

Python 87 5 Updated Feb 12, 2026

mnn tts demo.

C++ 19 2 Updated May 7, 2025

mnn asr demo.

C++ 25 2 Updated Mar 24, 2025

llm deploy project based onnx.

C++ 50 8 Updated Oct 9, 2024
Next