Skip to content
View inisis's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report inisis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A lightweight, single-header C++11 Jinja2 template engine for LLM chat templates.

C++ 14 3 Updated Dec 18, 2025

A collection of practical, end-to-end AI application examples accelerated by MemryX hardware and software solutions. This repository offers examples for real-time video inference, object detection…

Python 3 Updated Dec 13, 2025

JAX bindings for Flash Attention v2

C++ 101 8 Updated Nov 3, 2025

Tokamax: A GPU and TPU kernel library.

Python 136 6 Updated Dec 20, 2025

Customizable Reinforcement Learning

Python 181 16 Updated Dec 21, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,696 218 Updated Dec 21, 2025

The repository provides code for running inference with the Meta Segment Anything Model 3 (SAM 3).

Python 35 5 Updated Dec 21, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,271 350 Updated Dec 22, 2025

SAM 3D Objects

Python 5,022 463 Updated Dec 16, 2025

Ultralytics YOLO πŸš€

Python 50,186 9,691 Updated Dec 22, 2025

The best ChatGPT that $100 can buy.

Python 39,015 4,937 Updated Dec 9, 2025

A Toolkit to Help Optimize Onnx Model

Python 281 26 Updated Dec 15, 2025
1 Updated Aug 28, 2025

Multi-stream video inference with Ultralytics YOLO - Display multiple video streams in a grid layout with real-time object detection.

Python 12 1 Updated Aug 30, 2025

πŸ€— Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime

Python 102 31 Updated Dec 15, 2025

A high-performance tool for video upscaling, interpolation, depth estimation, and more. Available as a CLI and Adobe Extension.

Python 223 6 Updated Dec 20, 2025

πŸš€ Accelerate inference and training of πŸ€— Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,222 611 Updated Dec 19, 2025

Use safetensors with ONNX πŸ€—

Python 78 5 Updated Oct 1, 2025

mnn tts demo.

C++ 18 2 Updated May 7, 2025

mnn asr demo.

C++ 23 2 Updated Mar 24, 2025

llm deploy project based onnx.

C++ 48 8 Updated Oct 9, 2024

caffe model to onnx

Python 34 12 Updated Nov 16, 2022

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,152 1,172 Updated Mar 14, 2025

Verifile

Rust 1 Updated Aug 10, 2024

。

TypeScript 44 10 Updated Oct 12, 2025

State-of-the-art Machine Learning for the web. Run πŸ€— Transformers directly in your browser, with no need for a server!

JavaScript 15,115 1,055 Updated Dec 21, 2025

FlagGems is an operator library for large language models implemented in the Triton Language.

Python 805 180 Updated Dec 19, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,742 2,141 Updated Dec 22, 2025
Next