Skip to content
View gru1's full-sized avatar

Block or report gru1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Knowledge DIstillation for LLMs

Jupyter Notebook 1 1 Updated Feb 24, 2026

AmneziaWG installer

Rust 339 18 Updated Apr 2, 2026

A coding-free framework built on PyTorch for reproducible deep learning studies. PyTorch Ecosystem. 🏆26 knowledge distillation methods presented at TPAMI, CVPR, ICLR, ECCV, NeurIPS, ICCV, AAAI, etc…

Python 1,606 143 Updated Mar 31, 2026

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,252 160 Updated Apr 1, 2026

A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimodal, and on-policy self-distillation.

Python 76 6 Updated Apr 2, 2026

Pytorch implementation of various Knowledge Distillation (KD) methods.

Python 1,745 267 Updated Nov 25, 2021

Multi-Teacher Knowledge Distillation, code for my PhD dissertation. I used knowledge distillation as a decision-fusion and compressing mechanism for ensemble models.

Jupyter Notebook 28 3 Updated May 19, 2023

A pipeline for LLM knowledge distillation

Python 113 14 Updated Mar 23, 2026

Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.

2,435 129 Updated Mar 2, 2026

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 29,213 3,546 Updated Dec 5, 2025

Visual Causal Flow

Python 2,649 218 Updated Feb 3, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)

Python 375 37 Updated Mar 26, 2026

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,791 209 Updated Mar 25, 2026

A minimal PyTorch re-implementation of Qwen 3.5

Python 397 28 Updated Mar 5, 2026

Contexts Optical Compression

Python 22,774 2,095 Updated Jan 27, 2026

[ICCV 2025] Official implementation of LLaVA-KD: A Framework of Distilling Multimodal Large Language Models

Python 127 8 Updated Oct 14, 2025

Official codebase for the paper LaViT

Python 27 Updated Feb 15, 2026

Code for 'Three Minds, One Student: Online Multi-Teacher Knowledge Distillation for Multimodal Recommendation'

Python 1 Updated Jan 29, 2026

A curated list of awesome papers on dataset distillation and related applications.

HTML 1,918 172 Updated Apr 2, 2026

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 2,336 330 Updated Apr 2, 2026

Images to inference with no labeling (use foundation models to train supervised models).

Python 2,656 215 Updated May 14, 2025

a toolkit on knowledge distillation for large language models

Python 302 33 Updated Mar 10, 2026

Awesome Knowledge Distillation

3,833 512 Updated Mar 22, 2026

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,877 2,234 Updated Apr 2, 2026

Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments

Python 47 4 Updated Feb 14, 2025

vision language models finetuning notebooks & use cases (Medgemma - paligemma - florence .....)

Jupyter Notebook 61 13 Updated Oct 7, 2025

Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!

Python 9,101 734 Updated Apr 2, 2026

[COLM'25] Official implementation of the Law of Vision Representation in MLLMs

Python 176 8 Updated Oct 6, 2025
Next