Skip to content
View ktho22's full-sized avatar

Organizations

@nota-github @Nota-NetsPresso

Block or report ktho22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 11 7 Updated Oct 31, 2021

FINE (NeurIPS 2021)

Python 10 7 Updated Nov 22, 2021
Jupyter Notebook 20 7 Updated Oct 12, 2024
Python 10 7 Updated Dec 27, 2023

Neural Network Compression for Edge AI

Python 13 7 Updated Dec 19, 2024
Jupyter Notebook 17 6 Updated Oct 10, 2024

EVF is A web application framework for managing and optimizing deep learning models for edge devices with GPU support and real-time monitoring capabilities.

JavaScript 9 6 Updated Mar 7, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,004 278 Updated Feb 18, 2026

A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]

Python 61 6 Updated Mar 8, 2024

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

Python 74 12 Updated Sep 29, 2025

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 90 12 Updated Sep 13, 2024

TAO Toolkit deep learning networks with PyTorch backend

Python 107 26 Updated Dec 5, 2025

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,609 197 Updated Jul 12, 2024

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

TypeScript 10,990 712 Updated Apr 23, 2024

The official NetsPresso Python package.

Jupyter Notebook 48 1 Updated Nov 20, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,257 193 Updated Mar 27, 2024

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]

Python 312 19 Updated Jul 6, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 156,629 32,125 Updated Feb 18, 2026

LLaMa/RWKV onnx models, quantization and testcase

Python 366 29 Updated Jul 6, 2023

A timeline of the latest AI models for audio generation, starting in 2023!

1,913 71 Updated Jan 4, 2024

Development repository for the Triton language and compiler

MLIR 18,445 2,590 Updated Feb 18, 2026

Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

Python 2,673 90 Updated Apr 25, 2023

Stable Diffusion with Core ML on Apple Silicon

Python 17,796 1,048 Updated Jul 3, 2025
Python 10 7 Updated Nov 22, 2023

pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020

Python 30 4 Updated Jul 6, 2023
Next