Skip to content
View ForeverBlue816's full-sized avatar

Block or report ForeverBlue816

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

HTML 623 36 Updated Jun 3, 2026

[ICML 2026] Gated Relational Alignment via Confidence-based Distillation for Efficient VLMs

Jupyter Notebook 22 2 Updated Jun 4, 2026

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,219 721 Updated Jun 12, 2026

A framework for few-shot evaluation of language models.

Python 12,943 3,335 Updated Jun 2, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 19,372 1,786 Updated Jan 30, 2026

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 402 90 Updated Feb 14, 2025

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 899 82 Updated Nov 26, 2025

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 258 8 Updated Jul 1, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,560 317 Updated Jul 17, 2025

[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.

Python 724 79 Updated May 14, 2026

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 4,224 603 Updated Jun 11, 2026

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers.

Python 1,451 140 Updated Jun 13, 2026

[NeurIPS 2025] PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation

Python 187 25 Updated Oct 20, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,341 304 Updated May 11, 2025

[NER 2025 Spotlight] WaveFormer: A Lightweight Transformer Model for sEMG-based Gesture Recognition

Python 111 15 Updated Oct 24, 2025