Skip to content
View cyx0406's full-sized avatar

Block or report cyx0406

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

Python 2,287 195 Updated Mar 27, 2024
Python 29 Updated Mar 17, 2026

All parts of Claude Code's system prompt, 24 builtin tool descriptions, sub agent prompts (Plan/Explore/Task), utility prompts (CLAUDE.md, compact, statusline, magic docs, WebFetch, Bash cmd, secur…

JavaScript 8,541 1,580 Updated Apr 9, 2026

LaTeX Thesis Template for Tsinghua University

TeX 5,246 1,144 Updated Apr 4, 2026

Repository for StableQAT

Python 8 Updated Apr 8, 2026

ReActNet: Towards Precise Binary NeuralNetwork with Generalized Activation Functions. In ECCV 2020.

Python 264 43 Updated Nov 11, 2021

Quartet II Official Code

Python 66 8 Updated Mar 23, 2026

PyTorch building blocks for the OLMo ecosystem

Python 1,130 224 Updated Apr 9, 2026

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,459 456 Updated Apr 24, 2023

Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.

Python 57 2 Updated Mar 12, 2026

Code for the papers: “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling” and “Adaptive Block-Scaled Data Types”

Python 165 16 Updated Apr 7, 2026
Jupyter Notebook 120 12 Updated Mar 18, 2026

Spectral Sphere Optimizer

Python 114 2 Updated Mar 23, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 4,256 310 Updated Jan 14, 2026
Python 1 1 Updated Nov 11, 2025

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,451 249 Updated Apr 8, 2026
Python 4 1 Updated Feb 13, 2021

[ICLR25] STBLLM: Breaking the 1-Bit Barrier with Structured Binary LLMs

Python 20 2 Updated Jun 3, 2025

同时兼容 Mac 和 Windows 的常用字体

288 30 Updated May 19, 2017

QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

C++ 175 21 Updated Nov 11, 2025

GPU documentation for humans

Python 558 69 Updated Mar 24, 2026

[ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Python 36 4 Updated Nov 4, 2025

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python 294 18 Updated Feb 24, 2026

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,265 690 Updated Apr 9, 2026

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 149 86 Updated May 29, 2025

A framework to compare low-bit integer and float-point formats

Python 71 7 Updated Feb 6, 2026

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,789 392 Updated Mar 27, 2026
Next