Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 11,827 1,084 Updated Dec 24, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 64,432 7,814 Updated Dec 24, 2025

CLUEbenchmark / SuperCLUE

SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

3,269 112 Updated Sep 8, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,379 1,455 Updated Nov 28, 2025

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,054 1,098 Updated Dec 23, 2025

MME-Benchmarks / MME-RealWorld

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 150 11 Updated Oct 21, 2025

Hannibal046 / Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

25,858 2,226 Updated Jul 31, 2025

mani-kantap / llm-inference-solutions

A collection of all available inference solutions for the LLMs

93 5 Updated Mar 1, 2025

VITA-MLLM / VITA

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,465 180 Updated Mar 28, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,445 479 Updated Aug 7, 2024

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Markdown 1,123 52 Updated Feb 24, 2025

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

3,039 229 Updated Oct 14, 2025

jacobgil / vit-explain

Explainability for Vision Transformers

Python 1,022 108 Updated Mar 12, 2022

ge25nab / Awesome-VLM-AD-ITS

[T-IV] This repository collects research papers of large Vision Language Models in Autonomous driving and Intelligent Transportation System. The repository will be continuously updated to track the…

435 31 Updated Apr 1, 2025

OpenGVLab / TPO

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

Jupyter Notebook 64 6 Updated Jul 22, 2025

ZJU-LLMs / Foundations-of-LLMs

A book for Learning the Foundations of LLMs

15,022 1,387 Updated Dec 12, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 574 16 Updated Apr 13, 2025

dqbd / tiktokenizer

Online playground for OpenAPI tokenizers

TypeScript 1,462 160 Updated Apr 24, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,313 58 Updated Dec 7, 2025

NVlabs / Eagle

Eagle: Frontier Vision-Language Models with Data-Centric Strategies

Python 912 48 Updated Oct 25, 2025

Kwai-YuanQi / MM-RLHF

The Next Step Forward in Multimodal LLM Alignment

Python 193 8 Updated May 1, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,249 691 Updated Nov 24, 2025

inclusionAI / AWorld

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,080 109 Updated Dec 24, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,441 434 Updated Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zw ruan zw-ruan

Block or report zw-ruan

LLM/VLM

OpenGVLab / VisionLLM

karpathy / LLM101n

haotian-liu / LLaVA

baaivision / EVA

yuhui-zh15 / VLMClassifier

mlfoundations / dclm

modelscope / ms-swift