WilliamYi96

Focusing

Kai Yi WilliamYi96

Focusing

Research Scientist @ Meta, CA. Focusing on model compression and inference acceleration.

86 followers · 1 following

Meta Platforms, Inc
Sunnyvale, CA
21:09 (UTC -07:00)
https://kaiyi.me

Achievements

Lists (1)

Sort

OPT

Stars

facebookresearch / WinQ

WinQ Accelerating Quantization-Aware Training for LLMs around Saddle Points

Python 5 Updated Nov 15, 2025

WilliamYi96 / FedComLoc

FedComLoc: Communication-Efficient Distributed Training of Sparse and Quantized Models. TMLR, 2025.

Python 1 Updated Sep 16, 2025

pprp / Pruner-Zero

[ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs

Python 100 11 Updated Nov 25, 2024

alirezadir / Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 8,272 1,473 Updated Nov 28, 2025

chiphuyen / dmls-book

Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)

4,794 952 Updated Oct 31, 2025

yangshun / tech-interview-handbook

Curated coding interview preparation materials for busy software engineers

TypeScript 139,600 16,582 Updated Apr 5, 2026

zixian2021 / AI-interview-cards

最完整的AI算法面试题目仓库，1000道，25个类目

1,342 120 Updated Aug 13, 2023

SonyResearch / FedP3

Implementation of the paper: "FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity" (ICLR 2024)

Python 7 1 Updated Jun 27, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,278 76 Updated May 21, 2025

Vahe1994 / AQLM

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1,320 194 Updated Feb 26, 2026

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,693 167 Updated Oct 28, 2024

kyegomez / BitNet

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,933 172 Updated Apr 27, 2026

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,129 2,699 Updated Jan 23, 2026

WilliamYi96 / EF-BV

EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization. NeurIPS, 2022

Python 3 Updated Jan 24, 2024

WilliamYi96 / VR-ProxSkip

Variance Reduced ProxSkip: Algorithm, Theory and Application to Federated Learning. NeurIPS, 2022

Jupyter Notebook 3 Updated Jan 24, 2024

ml-explore / mlx-examples

Examples in the MLX framework

Python 8,621 1,163 Updated Apr 6, 2026

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,963 616 Updated May 3, 2024

S-LoRA / S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,913 124 Updated Jan 21, 2024

pjlab-sys4nlp / llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 1,001 60 Updated Dec 6, 2024

llm-eff / FedPepTAO

Python 30 4 Updated Mar 4, 2024

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,927 2,190 Updated Jul 29, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,473 4,794 Updated May 1, 2026

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,533 900 Updated Dec 17, 2024

VainF / Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3,308 382 Updated Sep 7, 2025

horseee / LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 1,127 132 Updated Oct 7, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,159 939 Updated Mar 11, 2025

SMILELab-FL / FedLab

A flexible Federated Learning Framework based on PyTorch, simplifying your Federated Learning research.

Jupyter Notebook 825 144 Updated Oct 20, 2025

weigq / iclr2023_stats

ICLR2023 statistics

HTML 59 2 Updated Nov 11, 2023

yaronn / blessed-contrib

Build terminal dashboards using ascii/ansi art and javascript

JavaScript 15,736 838 Updated May 1, 2026

wx-zhang / IGCZSL

Python 9 Updated Jan 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kai Yi WilliamYi96

Achievements

Achievements

Block or report WilliamYi96

Lists (1)

OPT

Stars

facebookresearch / WinQ

WilliamYi96 / FedComLoc

pprp / Pruner-Zero

alirezadir / Machine-Learning-Interviews

chiphuyen / dmls-book

yangshun / tech-interview-handbook

zixian2021 / AI-interview-cards

SonyResearch / FedP3

facebookresearch / schedule_free

Vahe1994 / AQLM

jiaweizzhao / GaLore

kyegomez / BitNet

microsoft / unilm

WilliamYi96 / EF-BV

WilliamYi96 / VR-ProxSkip

ml-explore / mlx-examples

jzhang38 / TinyLlama

S-LoRA / S-LoRA

pjlab-sys4nlp / llama-moe

llm-eff / FedPepTAO

tloen / alpaca-lora

lm-sys / FastChat

microsoft / LoRA

VainF / Torch-Pruning

horseee / LLM-Pruner

RUCAIBox / LLMSurvey

SMILELab-FL / FedLab

weigq / iclr2023_stats

yaronn / blessed-contrib

wx-zhang / IGCZSL