Skip to content
View terarachang's full-sized avatar

Block or report terarachang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs

Python 180 15 Updated Sep 23, 2025

LLM KV cache compression made easy

Python 729 83 Updated Dec 15, 2025

[COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"

Python 61 5 Updated Jul 8, 2025

A series of technical report on Slow Thinking with LLM

Python 753 41 Updated Aug 13, 2025

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Python 322 25 Updated Mar 4, 2025

[ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Python 227 16 Updated Jan 11, 2025

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 362 64 Updated Feb 14, 2025

The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.

Python 180 20 Updated May 13, 2022

A library for finding knowledge neurons in pretrained transformer models.

Python 158 19 Updated Feb 13, 2022

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,332 2,133 Updated Dec 18, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,263 4,030 Updated Jul 17, 2024

Tensors, for human consumption

Jupyter Notebook 1,337 22 Updated Nov 17, 2025

A framework for few-shot evaluation of language models.

Python 11,020 2,922 Updated Dec 23, 2025
Python 64 5 Updated Nov 28, 2022

[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"

Python 109 16 Updated Jul 15, 2023

Ream: A paper manager

HTML 4 2 Updated Oct 28, 2023

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,162 1,339 Updated Jul 23, 2024

Code for paper "UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning", ACL 2022

Python 63 5 Updated Mar 23, 2022

Performance Prediction for NLP Tasks

Jupyter Notebook 17 4 Updated May 5, 2020

Code for TACL 2020 paper "An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models"

Python 14 Updated Jul 31, 2020

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)

Python 541 43 Updated Mar 24, 2022

Bilinear attention networks for visual question answering

Python 548 99 Updated Oct 30, 2023

Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"

Python 1,516 227 Updated Apr 3, 2024

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

Python 539 102 Updated May 1, 2023

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 2,048 210 Updated Jan 9, 2024

Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning (MAML)

Python 2,469 439 Updated May 16, 2019

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,046 6,639 Updated Sep 30, 2025
Next