Skip to content
View itsucks's full-sized avatar

Block or report itsucks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
38 stars written in Python
Clear filter

Easy and Efficient Transformer : Scalable Inference Solution For Large NLP model

Python 264 44 Updated Nov 30, 2024

[ICML'21 Oral] I-BERT: Integer-only BERT Quantization

Python 260 42 Updated Jan 29, 2023

This repository contains the code for "Generating Datasets with Pretrained Language Models".

Python 189 24 Updated Aug 17, 2021

Block-sparse primitives for PyTorch

Python 160 23 Updated Apr 5, 2021
Python 44 9 Updated Jul 14, 2021

Implementation of ICLR 2018 paper "Loss-aware Weight Quantization of Deep Networks"

Python 26 4 Updated Oct 24, 2019

Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"

Python 18 7 Updated Feb 24, 2019