wangqinsi1

Wang Qinsi wangqinsi1

12 followers · 2 following

https://wangqinsi1.github.io/

Achievements

Stars

38 results for source starred repositories

Clear filter

FastMAS / KVCOMM

[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems

Python 26 3 Updated Nov 3, 2025

linyueqian / VERA

Python 33 1 Updated Nov 4, 2025

tongjingqi / Game-RL

Game-RL: Synthesizing Multimodal Verifiable Game Data to Boost VLMs' General Reasoning

Python 103 2 Updated Oct 16, 2025

metauto-ai / GPTSwarm

🐝 When Agent Meets RL and Prompt Optimization the First Time

Python 963 82 Updated Jan 3, 2025

EvoAgentX / Awesome-Self-Evolving-Agents

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

1,281 78 Updated Oct 11, 2025

wangqinsi1 / Vision-Zero

This is the official Python version of Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.

Python 98 2 Updated Oct 21, 2025

luo-junyu / Awesome-Agent-Papers

[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges

1,999 56 Updated Oct 10, 2025

Ting-Justin-Jiang / sada-icml

[ICML 2025] Official Repo for Stability-guided Adaptive Diffusion Acceleration. 🚀🌙Accelerating off-the-shelf diffusion model with a unified stability criterion.

Python 32 4 Updated Jul 24, 2025

linyueqian / HippoMM

HippoMM: Hippocampal-inspired Multimodal Memory

Python 13 Updated May 22, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,256 419 Updated Nov 3, 2025

showlab / Impossible-Videos

ICML 2025 - Impossible Videos

Python 78 8 Updated Jul 23, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,903 946 Updated Nov 6, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,674 366 Updated Oct 21, 2025

wangqinsi1 / GAINRL

This is the official Python version of Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.

Python 77 9 Updated Sep 26, 2025

wangqinsi1 / 2025-ICML-CoreMatching

This is Official PyTorch implementation for 2025-ICML-CoreMatching: Co-adaptive Sparse Inference Framework for Comprehensive Acceleration of Vision Language Model

Python 12 2 Updated May 27, 2025

Gen-Verse / MMaDA

[NeurIPS 2025] MMaDA - Open-Sourced Multimodal Large Diffusion Language Models

Python 1,470 71 Updated Oct 13, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,673 439 Updated Nov 4, 2025

assafelovic / gpt-researcher

An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.

Python 24,039 3,174 Updated Oct 25, 2025

StarsfieldAI / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,976 290 Updated May 19, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,948 3,921 Updated Nov 6, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,614 2,401 Updated Sep 8, 2025

mkantwala / DeepSeek-R1-TrainingSuite

Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fi…

Python 13 3 Updated Jan 29, 2025

wangqinsi1 / Dobi-SVD

Official code implementation for 2025 ICLR accepted paper "Dobi-SVD : Differentiable SVD for LLM Compression and Some New Perspectives"

Python 47 6 Updated Oct 19, 2025

Ah-miu / Dobi-SVD.page

"Knock, knock!" "Who's there?" "Dobi."

HTML 17 1 Updated Aug 11, 2025

42Shawn / LLaVA-PruMerge

LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models

Python 153 9 Updated Sep 27, 2025

Alpha-Innovator / AdaptiveDiffusion

[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

Python 72 5 Updated Jan 22, 2025

wangqinsi1 / CoreInfer

This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation.

Jupyter Notebook 17 2 Updated Oct 25, 2024

UMich-CURLY / LatentBKI

Repository for latent Bayesian Kernel Inference

Python 7 1 Updated Apr 1, 2025

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,902 2,659 Updated Aug 12, 2024

FasterDecoding / TEAL

Python 147 11 Updated Feb 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wang Qinsi wangqinsi1

Achievements

Achievements

Block or report wangqinsi1

Stars

FastMAS / KVCOMM

linyueqian / VERA

tongjingqi / Game-RL

metauto-ai / GPTSwarm

EvoAgentX / Awesome-Self-Evolving-Agents

wangqinsi1 / Vision-Zero

luo-junyu / Awesome-Agent-Papers

Ting-Justin-Jiang / sada-icml

linyueqian / HippoMM

EvolvingLMMs-Lab / lmms-eval

showlab / Impossible-Videos

modelscope / ms-swift

om-ai-lab / VLM-R1

wangqinsi1 / GAINRL

wangqinsi1 / 2025-ICML-CoreMatching

Gen-Verse / MMaDA

rllm-org / rllm

assafelovic / gpt-researcher

StarsfieldAI / R1-V

unslothai / unsloth

huggingface / open-r1

mkantwala / DeepSeek-R1-TrainingSuite

wangqinsi1 / Dobi-SVD

Ah-miu / Dobi-SVD.page

42Shawn / LLaVA-PruMerge

Alpha-Innovator / AdaptiveDiffusion

wangqinsi1 / CoreInfer

UMich-CURLY / LatentBKI

haotian-liu / LLaVA

FasterDecoding / TEAL