Skip to content
View szq0214's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Highlights

  • Pro

Organizations

@VILA-Lab

Block or report szq0214

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026] BiGain is a training-free framework for accelerating diffusion models while preserving generation quality and improving classification.

Python 10 Updated Mar 19, 2026

Official implementation of paper: From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering

Python 12 1 Updated Mar 27, 2026

Official implementation of paper: Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting

Python 9 Updated Feb 20, 2026

Official code for our paper "Sink-Aware Pruning for Diffusion Language Models"

Python 12 Updated Feb 26, 2026

A defense framework against MLLM-based web GUI agents. This repository provides both the generative CAPTCHA system and tools for evaluating agent resistance.

Python 19 Updated Mar 28, 2026

[ICLR 2026 🔥] Official pytorch implementation for "Attention Is All You Need for KV Cache in Diffusion LLMs"

Python 37 3 Updated Jan 23, 2026

Hard Labels In! Rethinking the Role of Hard Labels in Mitigating Local Semantic Drift

Python 6 Updated Dec 23, 2025

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

941 38 Updated Mar 10, 2026

Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"

Python 54 3 Updated Jul 5, 2025

[ICLR 2026] Optimization-free Dataset Distillation for Object Detection. Paper at: https://arxiv.org/abs/2506.01942

Python 29 1 Updated Jan 26, 2026

(ACL 2025 Main) Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillation

Python 35 2 Updated Aug 23, 2025

[CVPR 2026 🔥] Time Blindness: Why Video-Language Models Can't See What Humans Can?

Python 62 2 Updated Jan 28, 2026

[NeurIPS 2025] The first web-based benchmark and platform to evaluate visual reasoning and interaction capabilities of MLLM powered agents through diverse and dynamic CAPTCHA puzzles.

JavaScript 61 2 Updated Feb 19, 2026

[NeurIPS25 & ICML25 Workshop on Reliable and Responsible Foundation Models] A Simple Baseline Achieving Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1. Paper at: https:/…

Python 91 6 Updated Feb 3, 2026

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

Python 66 4 Updated May 24, 2025

The official implementation of Bi-Mamba

Python 15 Updated Oct 22, 2025

Dataset Distillation via Committee Voting

Shell 14 1 Updated Jul 28, 2025

Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark. Paper at: https://arxiv.org/abs/2503.20786

Python 11 1 Updated Mar 27, 2025

(CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA top 1-acc by +1.3% and increases diversity per class by +5%

Python 27 Updated Aug 23, 2025

Official inference framework for 1-bit LLMs

Python 38,066 3,403 Updated Mar 10, 2026

[ICLR2025] γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models

Python 43 2 Updated Oct 28, 2025

Elucidated Dataset Condensation (NeurIPS 2024)

Python 20 Updated Oct 5, 2024

Semantics-Aware Patch Encoding and Hierarchical Dependency Modeling for Long-Term Time Series Forecasting

Python 49 6 Updated Aug 7, 2025

FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation

Python 51 1 Updated Aug 24, 2025

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

Python 102 8 Updated Oct 23, 2024

Open-LLM-Leaderboard: Open-Style Question Evaluation. Paper at https://arxiv.org/abs/2406.07545

Python 51 7 Updated Jun 27, 2024

The official Meta Llama 3 GitHub site

Python 29,290 3,529 Updated Jan 26, 2025

Prompt Engineering at Your Fingertips!

Python 110 30 Updated Feb 12, 2025

Prompt Builder is a small Python application that implements the principles outlined in the paper "Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4". It allows users to…

Python 36 11 Updated Apr 12, 2024

A principled instruction benchmark on formulating effective queries and prompts for large language models (LLMs). Our paper: https://arxiv.org/abs/2312.16171

Python 980 105 Updated May 28, 2024
Next