Skip to content
View WilliamYi96's full-sized avatar
:atom:
Focusing
:atom:
Focusing

Block or report WilliamYi96

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WinQ Accelerating Quantization-Aware Training for LLMs around Saddle Points

Python 5 Updated Nov 15, 2025

FedComLoc: Communication-Efficient Distributed Training of Sparse and Quantized Models. TMLR, 2025.

Python 1 Updated Sep 16, 2025

[ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs

Python 100 11 Updated Nov 25, 2024

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 8,272 1,473 Updated Nov 28, 2025

Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)

4,794 952 Updated Oct 31, 2025

Curated coding interview preparation materials for busy software engineers

TypeScript 139,600 16,582 Updated Apr 5, 2026

最完整的AI算法面试题目仓库,1000道,25个类目

1,342 120 Updated Aug 13, 2023

Implementation of the paper: "FedP3: Federated Personalized and Privacy-friendly Network Pruning under Model Heterogeneity" (ICLR 2024)

Python 7 1 Updated Jun 27, 2024

Schedule-Free Optimization in PyTorch

Python 2,278 76 Updated May 21, 2025

Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Ext…

Python 1,320 194 Updated Feb 26, 2026

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,693 167 Updated Oct 28, 2024

Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

Python 1,933 172 Updated Apr 27, 2026

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,129 2,699 Updated Jan 23, 2026

EF-BV: A Unified Theory of Error Feedback and Variance Reduction Mechanisms for Biased and Unbiased Compression in Distributed Optimization. NeurIPS, 2022

Python 3 Updated Jan 24, 2024

Variance Reduced ProxSkip: Algorithm, Theory and Application to Federated Learning. NeurIPS, 2022

Jupyter Notebook 3 Updated Jan 24, 2024

Examples in the MLX framework

Python 8,621 1,163 Updated Apr 6, 2026

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,963 616 Updated May 3, 2024

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,913 124 Updated Jan 21, 2024

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)

Python 1,001 60 Updated Dec 6, 2024
Python 30 4 Updated Mar 4, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,927 2,190 Updated Jul 29, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,473 4,794 Updated May 1, 2026

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 13,533 900 Updated Dec 17, 2024

[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.

Python 3,308 382 Updated Sep 7, 2025

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 1,127 132 Updated Oct 7, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 12,159 939 Updated Mar 11, 2025

A flexible Federated Learning Framework based on PyTorch, simplifying your Federated Learning research.

Jupyter Notebook 825 144 Updated Oct 20, 2025

ICLR2023 statistics

HTML 59 2 Updated Nov 11, 2023

Build terminal dashboards using ascii/ansi art and javascript

JavaScript 15,736 838 Updated May 1, 2026
Python 9 Updated Jan 23, 2024
Next