int8

Here are 33 public repositories matching this topic...

aahouzi / llama2-chatbot-cpu

A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.

Updated Feb 27, 2024
Python

douzsh / mxnet-quantized

Star

mxnet GluonCV quantization binary ternary models

mxnet binary quantization ternary int8 gluoncv

Updated May 22, 2019
Python

stdlib-js / napi-argv-strided-int8array2d

Sponsor

Star

Convert a Node-API value representing a two-dimensional strided array to a signed 8-bit integer array.

nodejs javascript node utilities native addon utils matrix array stdlib macros ndarray node-js integer napi int8 strided

Updated Feb 10, 2025
C

stdlib-js / constants-int8-num-bytes

Sponsor

Star

Size (in bytes) of an 8-bit signed integer.

Updated Dec 1, 2024
JavaScript

stdlib-js / napi-argv-int8array

Sponsor

Star

Convert a Node-API value to a signed 8-bit integer array.

nodejs javascript node utilities native addon utils array stdlib macros node-js napi int8 int

Updated Feb 3, 2025
C

Egorundel / int8_calibrator_cpp

Star

INT8 calibrator for ONNX model with dynamic batch_size at the input and NMS module at the output. C++ Implementation.

cpp calibration tensorrt int8 onnx

Updated Oct 15, 2024
C++

MrFMach / Practice-C-types

Star

Practicing C data types using the sizeof function

c int64 float double char int8 int16 int32 sizeof int128

Updated Sep 21, 2020
C

yester31 / Quantization_Framework

Star

development quantization framework

compression optimization quantization int8 infernece

Updated Sep 9, 2023
Python

egbertYeah / mt-yolov6_tensorrt

Star

MT-Yolov6 TensorRT Inference with Python.

tensorrt int8 yolov6

Updated Jul 2, 2022
Python

JohnClaw / chatllm.vb

Star

VB.NET api wrapper for llm-inference chatllm.cpp

bindings api-wrapper llama vb-net vbnet gemma mistral int8 int8-inference int8-quantization cpu-inference chatllm ggml llm-inference qwen

Updated Nov 26, 2024
Visual Basic .NET

JohnClaw / chatllm.cs

Star

C# api wrapper for llm-inference chatllm.cpp

csharp inference bindings api-wrapper llama gemma mistral int8 int8-inference int8-quantization cpu-inference llm llms chatllm ggml llm-inference qwen

Updated Nov 20, 2024
C#

stdlib-js / constants-int8-max

Sponsor

Star

Maximum signed 8-bit integer.

nodejs javascript node stdlib max const signed node-js maximum integer constant 8-bit int8 int

Updated Feb 24, 2025
JavaScript

stdlib-js / constants-int8

Sponsor

Star

8-bit signed integer mathematical constants.

Updated Feb 24, 2025
JavaScript

stdlib-js / assert-is-int8array

Sponsor

Star

Test if a value is an Int8Array.

Updated Feb 3, 2025
JavaScript

stdlib-js / napi-argv-strided-int8array

Sponsor

Star

Convert a Node-API value representing a strided array to a signed 8-bit integer array.

nodejs javascript node utilities native addon utils array stdlib macros ndarray node-js napi int8 int strided

Updated Dec 9, 2024
C

stdlib-js / array-int8

Sponsor

Star

Int8Array.

nodejs javascript data node types array stdlib structure byte signed node-js integer typed int8 int typed-array int8array

Updated Jan 4, 2025
JavaScript

stdlib-js / constants-int8-min

Sponsor

Star

Minimum signed 8-bit integer.

nodejs javascript node stdlib const signed minimum min node-js integer constant 8-bit int8 int

Updated Feb 2, 2025
JavaScript

dasdristanta13 / LLM-Lora-PEFT_accumulate

Star

LLM-Lora-PEFT_accumulate explores optimizations for Large Language Models (LLMs) using PEFT, LORA, and QLORA. Contribute experiments and implementations to enhance LLM efficiency. Join discussions and push the boundaries of LLM optimization. Let's make LLMs more efficient together!

falcon llama lora alpaca int8 peft llm qlora bitsandbytes

Updated Jun 16, 2023
Jupyter Notebook

RyannnG / gie_int8_sample

Star

int8 tensorrt2

Updated Mar 27, 2017
C++

Improve this page

Add a description, image, and links to the int8 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the int8 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

int8

Here are 33 public repositories matching this topic...

aahouzi / llama2-chatbot-cpu

douzsh / mxnet-quantized

stdlib-js / napi-argv-strided-int8array2d

stdlib-js / constants-int8-num-bytes

stdlib-js / napi-argv-int8array

Egorundel / int8_calibrator_cpp

lbin / gie_int8_sample

MrFMach / Practice-C-types

yester31 / Quantization_Framework

egbertYeah / mt-yolov6_tensorrt

JohnClaw / chatllm.vb

JohnClaw / chatllm.cs

stdlib-js / constants-int8-max

stdlib-js / constants-int8

stdlib-js / assert-is-int8array

stdlib-js / napi-argv-strided-int8array

stdlib-js / array-int8

stdlib-js / constants-int8-min

dasdristanta13 / LLM-Lora-PEFT_accumulate

RyannnG / gie_int8_sample

Improve this page

Add this topic to your repo