Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,966 956 Updated Nov 10, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,683 366 Updated Oct 21, 2025

StarsfieldAI / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,980 290 Updated May 19, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,269 421 Updated Nov 10, 2025

Gen-Verse / MMaDA

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,478 71 Updated Oct 13, 2025

ShengranHu / ADAS

[ICLR 2025] Automated Design of Agentic Systems

Python 1,452 224 Updated Jan 28, 2025

metauto-ai / GPTSwarm

🐝 When Agent Meets RL and Prompt Optimization the First Time

Python 963 83 Updated Jan 3, 2025

42Shawn / LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 155 9 Updated Sep 27, 2025

FasterDecoding / TEAL

Python 147 11 Updated Feb 15, 2025

tongjingqi / Game-RL

Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

Python 104 2 Updated Oct 16, 2025

wangqinsi1 / Vision-Zero

This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.

Python 98 2 Updated Oct 21, 2025

linyueqian / VERA

Python 88 1 Updated Nov 4, 2025

wangqinsi1 / GAINRL

This is the official Python version of Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.

Python 78 9 Updated Sep 26, 2025

showlab / Impossible-Videos

ICML 2025 - Impossible Videos

Python 78 8 Updated Jul 23, 2025

Alpha-Innovator / AdaptiveDiffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Python 72 5 Updated Jan 22, 2025

FastMAS / KVCOMM

[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems

Python 55 6 Updated Nov 3, 2025

wangqinsi1 / Dobi-SVD

Official code implementation for 2025 ICLR accepted paper "Dobi-SVD : Differentiable SVD for LLM Compression and Some New Perspectives"

Python 47 6 Updated Oct 19, 2025

wangqinsi1 / MathNAS

This is Official PyTorch implementation for 2023-NeurIPS-MathNAS: If Blocks Have a Role in Mathematical Architecture Design.

Python 36 2 Updated Apr 10, 2024

Ting-Justin-Jiang / sada-icml

[ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. 🚀🌙Accelerating off-the-shelf diffusion model with a unified stability criterion.

Python 32 4 Updated Jul 24, 2025

dnhkng / PCAonGPU

A GPU-based Incremental PCA implementation.

Python 31 6 Updated Feb 18, 2025

mkantwala / DeepSeek-R1-TrainingSuite

Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fi…

Python 13 3 Updated Jan 29, 2025

linyueqian / HippoMM

HippoMM: Hippocampal-inspired Multimodal Memory

Python 13 Updated May 22, 2025

wangqinsi1 / 2025-ICML-CoreMatching

This is Official PyTorch implementation for 2025-ICML-CoreMatching: Co-adaptive Sparse Inference Framework for Comprehensive Acceleration of Vision Language Model

Python 12 2 Updated May 27, 2025

UMich-CURLY / LatentBKI

Repository for latent Bayesian Kernel Inference

Python 7 1 Updated Apr 1, 2025

wangqinsi1 / DGL

This is Official PyTorch implementation for 2023-TMC-DGL: Device Generic Latency model for Neural Architecture Search.

Python 4 Updated Nov 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wang Qinsi wangqinsi1

Achievements