Skip to content
View gru1's full-sized avatar

Block or report gru1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Knowledge DIstillation for LLMs

Jupyter Notebook 1 1 Updated Feb 24, 2026

AmneziaWG installer

Shell 275 16 Updated Mar 29, 2026

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at TPAMI, CVPR, ICLR, ECCV, NeurIPS, ICCV, AAAI, etc…

Python 1,605 143 Updated Dec 24, 2025

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,242 160 Updated Mar 29, 2026

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 73 6 Updated Mar 27, 2026

Pytorch implementation of various Knowledge Distillation (KD) methods.

Python 1,745 268 Updated Nov 25, 2021

Multi-Teacher Knowledge Distillation, code for my PhD dissertation. I used knowledge distillation as a decision-fusion and compressing mechanism for ensemble models.

Jupyter Notebook 28 3 Updated May 19, 2023

A pipeline for LLM knowledge distillation

Python 113 14 Updated Mar 23, 2026

Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.

2,357 128 Updated Mar 2, 2026

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 29,186 3,545 Updated Dec 5, 2025

Visual Causal Flow

Python 2,623 216 Updated Feb 3, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)

Python 365 37 Updated Mar 26, 2026

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,775 208 Updated Mar 25, 2026

A minimal PyTorch re-implementation of Qwen 3.5

Python 393 27 Updated Mar 5, 2026

Contexts Optical Compression

Python 22,763 2,093 Updated Jan 27, 2026

[ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

Python 126 8 Updated Oct 14, 2025

Official codebase for the paper LaViT

Python 27 Updated Feb 15, 2026

Code for 'Three Minds, One Student: Online Multi-Teacher Knowledge Distillation for Multimodal Recommendation'

Python 1 Updated Jan 29, 2026

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,915 171 Updated Mar 28, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,262 319 Updated Mar 29, 2026

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,654 213 Updated May 14, 2025

a toolkit on knowledge distillation for large language models

Python 302 33 Updated Mar 10, 2026

Awesome Knowledge Distillation

3,830 512 Updated Mar 22, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,857 2,232 Updated Mar 27, 2026

Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments

Python 47 4 Updated Feb 14, 2025

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)

Jupyter Notebook 61 13 Updated Oct 7, 2025

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 8,920 706 Updated Mar 29, 2026

[COLM'25] Official implementation of the Law of Vision Representation in MLLMs

Python 176 8 Updated Oct 6, 2025
Next