BLINKSK

Follow

BLINKSK

Follow

0 followers · 1 following

Lists (1)

Sort

VLM

LVLM/MLLM-Safety

20 repositories

Stars

FuxiaoLiu / LRV-Instruction

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

Python 296 15 Updated Mar 13, 2024

tianyi-lab / HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Python 336 9 Updated Oct 14, 2025

junyangwang0410 / AMBER

An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation

Python 162 7 Updated Jan 15, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,587 1,124 Updated Apr 3, 2026

NishilBalar / Awesome-LVLM-Hallucination

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

292 15 Updated Feb 8, 2026

showlab / Awesome-MLLM-Hallucination

📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).

1,004 46 Updated Sep 27, 2025

thu-ml / MMTrustEval

A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)

Python 173 12 Updated Jun 27, 2025

XuankunRong / Awesome-LVLM-Safety

A curated list of resources dedicated to the safety of Large Vision-Language Models. This repository aligns with our survey titled A Survey of Safety on Large Vision-Language Models: Attacks, Defen…

198 15 Updated Feb 6, 2026

CryptoAILab / Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,926 131 Updated Apr 2, 2026

tonngw / LeetCode021

🚀 LeetCode From Zero To One & 题单整理 & 题解分享 & 算法模板 & 刷题路线，持续更新中...

342 32 Updated Jul 13, 2025

YukeHu / vlm_mia

Code for paper "Membership Inference Attacks Against Vision-Language Models"

Python 28 3 Updated Jan 25, 2025

sail-sg / AnyDoor

AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models

Python 61 1 Updated Apr 8, 2024

ZHZisZZ / visual-stitching

[NeurIPS'25] VLMs Can Aggregate Scattered Training Patches

Python 14 3 Updated Jun 5, 2025

chichidd / llm-lora-trojan

Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"

Python 29 4 Updated Sep 11, 2024

glee4810 / EHRSQL

[NeurIPS'22] EHRSQL: A Practical Text-to-SQL Benchmark for Electronic Health Records

Python 104 16 Updated Mar 30, 2026

liu00222 / Open-Prompt-Injection

This repository provides a benchmark for prompt injection attacks and defenses in LLMs

Python 425 67 Updated Oct 29, 2025

Jinxhy / THEMIS

[USENIX Security'25] THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models

Python 108 1 Updated Aug 13, 2025

Jinxhy / On-device-AI-Resources

A List of On-device AI Resources

9 2 Updated Mar 5, 2026

Zhou-Zi7 / Awesome-AI-Security-BIG4

This Github repository summarizes a list of research papers on AI security from the four top academic conferences.

182 12 Updated May 21, 2025

CUHK-ARISE / MAS-Resilience

Code and data for the paper: On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents

Python 44 3 Updated Dec 15, 2025

IS2Lab / S-Eval

S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models

112 6 Updated Feb 13, 2026

Lyz1213 / BadEdit

Python 37 5 Updated Oct 17, 2024

xijia-tao / ImgTrojan

Code and data for "ImgTrojan: Jailbreaking Vision-Language Models with ONE Image"

Python 24 2 Updated Mar 26, 2025

mxzheng / TrojViT

[CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang

Python 14 2 Updated Jan 5, 2024

FruiteePro / GiveMeJob2025

2025秋招信息

55 5 Updated Sep 10, 2024

Zhang-Yihao / Adversarial-Representation-Engineering

Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.

Python 20 2 Updated Dec 6, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,191 1,102 Updated Nov 18, 2024

euanong / image-hijacks

Official codebase for Image Hijacks: Adversarial Images can Control Generative Models at Runtime

Python 53 11 Updated Sep 19, 2023

2U1 / Qwen-VL-Series-Finetune

An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.

Python 1,802 207 Updated Mar 25, 2026

ebagdasa / multimodal_injection

Jupyter Notebook 98 13 Updated Oct 15, 2023