Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1,197 101 Updated Feb 6, 2026

AIM-Intelligence / Automated-Multi-Turn-Jailbreaks

Python 114 6 Updated Dec 3, 2025

verazuo / jailbreak_llms

[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

Jupyter Notebook 3,550 317 Updated Dec 24, 2024

mlcommons / ailuminate

The AILuminate v1.1 benchmark suite is an AI risk assessment benchmark developed with broad involvement from leading AI companies, academia, and civil society.

69 14 Updated Jun 11, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,634 766 Updated Jun 25, 2025

smilegate-ai / korean_unsmile_dataset

440 33 Updated Apr 8, 2022

allenai / safety-eval

A simple evaluation of generative language models and safety classifiers.

Python 85 22 Updated Dec 11, 2025

EthicalML / awesome-production-machine-learning

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

20,102 2,511 Updated Feb 1, 2026

jihoo-kim / awesome-production-llm

A curated list of awesome open-source libraries for production LLM

508 56 Updated Dec 31, 2024

friedrichor / Awesome-Multimodal-Papers

A curated list of awesome Multimodal studies.

312 23 Updated Dec 14, 2025

meta-llama / PurpleLlama

Set of tools to assess and improve LLM security.

Python 4,010 695 Updated Feb 5, 2026

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,316 1,111 Updated Jan 27, 2026

ydyjya / Awesome-LLM-Safety

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,768 91 Updated Feb 1, 2026

ddehun / k-viscuit

Official repository for "Evaluating Visual and Cultural Interpretation: The K-Viscuit Benchmark with Human-VLM Collaboration, arXiv 2024.06" (https://arxiv.org/pdf/2406.16469)

Python 7 2 Updated Aug 17, 2024

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,340 308 Updated Jun 11, 2023

pliang279 / HEMM

Holistic evaluation of multimodal foundation models

Python 49 1 Updated Aug 11, 2024

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,625 512 Updated Feb 4, 2026

chrisliu298 / awesome-llm-unlearning

A resource repository for machine unlearning in large language models

533 30 Updated Jan 6, 2026

xai-org / grok-1

Grok open release

Python 51,464 8,495 Updated Aug 30, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,296 1,003 Updated Jul 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minseok Choi brightjade

Achievements

Achievements

Highlights

Block or report brightjade

Stars

godotengine / godot-demo-projects

godotengine / awesome-godot

xinyi-hou / LLM4SE_SLR

lmgame-org / GamingAgent

Farama-Foundation / Gymnasium

LLM-Testing / LLM4SoftwareTesting

ZJU-ACES-ISE / ChatUniTest

ligurio / awesome-ttygames

laude-institute / harbor

YerbaPage / Awesome-Repo-Level-Code-Generation

yueliu1999 / Awesome-Jailbreak-on-LLMs