szq0214

💭

I may be slow to respond.

Zhiqiang Shen szq0214

💭

I may be slow to respond.

355 followers · 20 following

CMU
Pittsburgh, USA
http://zhiqiangshen.com/
@szq0214

Achievements

Highlights

Organizations

Lists (1)

Sort

🔮 Future ideas

Stars

Greenoso / bigain

[CVPR 2026] BiGain is a training-free framework for accelerating diffusion models while preserving generation quality and improving classification.

Python 10 Updated Mar 19, 2026

VILA-Lab / PIXAR

Official implementation of paper: From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering

Python 12 1 Updated Mar 27, 2026

VILA-Lab / M-Attack-V2

Official implementation of paper: Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting

Python 9 Updated Feb 20, 2026

VILA-Lab / Sink-Aware-Pruning

Official code for our paper "Sink-Aware Pruning for Diffusion Language Models"

Python 12 Updated Feb 26, 2026

MetaAgentX / NextGen-CAPTCHAs

A defense framework against MLLM-based web GUI agents. This repository provides both the generative CAPTCHA system and tools for evaluating agent resistance.

Python 19 Updated Mar 28, 2026

VILA-Lab / Elastic-Cache

[ICLR 2026 🔥] Official pytorch implementation for "Attention Is All You Need for KV Cache in Diffusion LLMs"

Python 37 3 Updated Jan 23, 2026

Jiacheng8 / HALD

Hard Labels In! Rethinking the Role of Hard Labels in Mitigating Local Semantic Drift

Python 6 Updated Dec 23, 2025

VILA-Lab / Awesome-DLMs

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

941 38 Updated Mar 10, 2026

mbzuai-oryx / VideoMolmo

Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"

Python 54 3 Updated Jul 5, 2025

VILA-Lab / OD3

[ICLR 2026] Optimization-free Dataset Distillation for Object Detection. Paper at: https://arxiv.org/abs/2506.01942

Python 29 1 Updated Jan 26, 2026

VILA-Lab / DRAG

(ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillation

Python 35 2 Updated Aug 23, 2025

TimeBlindness / time-blindness

[CVPR 2026 🔥] Time Blindness: Why Video-Language Models Can't See What Humans Can?

Python 62 2 Updated Jan 28, 2026

MetaAgentX / OpenCaptchaWorld

[NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agents through diverse and dynamic CAPTCHA puzzles.

JavaScript 61 2 Updated Feb 19, 2026

VILA-Lab / M-Attack

[NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1. Paper at: https:/…

Python 91 6 Updated Feb 3, 2026

mbzuai-oryx / KITAB-Bench

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

Python 66 4 Updated May 24, 2025

Tangshengku / Bi-Mamba

The official implementation of Bi-Mamba

Python 15 Updated Oct 22, 2025

Jiacheng8 / CV-DD

Dataset Distillation via Committee Voting

Shell 14 1 Updated Jul 28, 2025

VILA-Lab / Mobile-MMLU

Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark. Paper at: https://arxiv.org/abs/2503.20786

Python 11 1 Updated Mar 27, 2025

VILA-Lab / DELT

(CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA top 1-acc by +1.3% and increases diversity per class by +5%

Python 27 Updated Aug 23, 2025

microsoft / BitNet

Official inference framework for 1-bit LLMs

Python 38,066 3,403 Updated Mar 10, 2026

Yaxin9Luo / Gamma-MOD

[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models

Python 43 2 Updated Oct 28, 2025

shaoshitong / EDC

Elucidated Dataset Condensation (NeurIPS 2024)

Python 20 Updated Oct 5, 2024

lunaaa95 / mou

Semantics-Aware Patch Encoding and Hierarchical Dependency Modeling for Long-Term Time Series Forecasting

Python 49 6 Updated Aug 7, 2025

LiqunMa / FBI-LLM

FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation

Python 51 1 Updated Aug 24, 2025

MBZUAI-LLM / web2code

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 102 8 Updated Oct 23, 2024

VILA-Lab / Open-LLM-Leaderboard

Open-LLM-Leaderboard: Open-Style Question Evaluation. Paper at https://arxiv.org/abs/2406.07545

Python 51 7 Updated Jun 27, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,290 3,529 Updated Jan 26, 2025

lim-hyo-jeong / Prompt-Enhancer

Prompt Engineering at Your Fingertips!

Python 110 30 Updated Feb 12, 2025

lypsoty112 / Prompt-builder

Prompt Builder is a small Python application that implements the principles outlined in the paper "Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4". It allows users to…

Python 36 11 Updated Apr 12, 2024

VILA-Lab / ATLAS

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Python 980 105 Updated May 28, 2024