kotlin api wrapper for llm-inference chatllm.cpp
-
Updated
Nov 26, 2024 - C
kotlin api wrapper for llm-inference chatllm.cpp
Nim api-wrapper for llm-inference chatllm.cpp
Uniform quantizer that uses mexCallMATLAB to call different MATLAB commands and plot the results
Interfacing the ESP32c3 microcontroller with an ArduCAM-M-2MP Camera Shield 2MP SPI Camera to perform image processing and computer vision
V-lang api wrapper for llm-inference chatllm.cpp
The purpose of this project is to compare different means of computing convolution operation, and see if naive quantiization actually speed ups operation.
Clean C language version of quantizing llama2 model and running quantized llama2 model
A quantized TensorFlow Lite–based real-time object detection system on ESP32-CAM, optimized with the EON™ Compiler for low-latency, low-memory, and portable AI-IoT deployment.
Code and resources for the paper: Real-Time Student Engagement Monitoring on Edge Devices: Deep Learning Meets Efficiency and Privacy
Neural Network C is an advanced neural network implementation in pure C, optimized for high performance on CPUs and NVIDIA GPUs.
Bird Audio Detection using FPGA
Code and resources for the paper: "Cognitive Radio Spectrum Sensing on the Edge: A Quantization-Aware Deep Learning Approach"
off the charts color quantization 🎨
Extremely fast color quantization. Reduce color information of a 24-bit RGB bitmap down to 8-bit.
Color quantization/palette generation for png images
Code for "Characterising Across Stack Optimisations for Deep Convolutional Neural Networks"
The Quantizer - A Swift-based reimplementation of ImageAlpha
Quantized Memory-Augmented Neural Networks (AAAI-18)
Add a description, image, and links to the quantization topic page so that developers can more easily learn about it.
To associate your repository with the quantization topic, visit your repo's landing page and select "manage topics."