Skip to content
View ktho22's full-sized avatar

Organizations

@nota-github @Nota-NetsPresso

Block or report ktho22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 11 7 Updated Oct 31, 2021

FINE (NeurIPS 2021)

Python 10 7 Updated Nov 22, 2021
Jupyter Notebook 20 7 Updated Oct 12, 2024
Python 10 7 Updated Dec 27, 2023

Neural Network Compression for Edge AI

Python 13 7 Updated Dec 19, 2024
Jupyter Notebook 17 6 Updated Oct 10, 2024

EVF is A web application framework for managing and optimizing deep learning models for edge devices with GPU support and real-time monitoring capabilities.

JavaScript 9 6 Updated Mar 7, 2025

A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative decoding, etc. It compresses deep learning models for downs…

Python 2,963 453 Updated Jun 22, 2026

A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]

Python 60 6 Updated Mar 8, 2024

A library for training, compressing and deploying computer vision models (including ViT) with edge devices

Python 74 11 Updated Sep 29, 2025

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]

Python 90 13 Updated Sep 13, 2024

TAO Toolkit deep learning networks with PyTorch backend

Python 112 27 Updated Jun 19, 2026

[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Python 1,661 206 Updated Jul 12, 2024

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

TypeScript 10,945 705 Updated Apr 23, 2024

The official NetsPresso Python package.

Jupyter Notebook 48 1 Updated Nov 20, 2025

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,323 201 Updated Mar 27, 2024

A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]

Python 318 20 Updated Jul 6, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,786 33,566 Updated Jun 22, 2026

LLaMa/RWKV onnx models, quantization and testcase

Python 368 29 Updated Jul 6, 2023

A timeline of the latest AI models for audio generation, starting in 2023!

1,910 71 Updated Jan 4, 2024

Development repository for the Triton language and compiler

MLIR 19,496 2,952 Updated Jun 22, 2026

Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

Python 2,722 94 Updated Apr 25, 2023

Stable Diffusion with Core ML on Apple Silicon

Python 17,912 1,060 Updated Jul 3, 2025
Python 10 7 Updated Nov 22, 2023

pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020

Python 30 4 Updated Jul 6, 2023
Next