Popular repositories Loading
-
-
ecco
ecco PublicForked from jalammar/ecco
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…
Jupyter Notebook
-
AutoDAN
AutoDAN PublicForked from SheltonLiu-N/AutoDAN
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
Python
-
refusal_direction
refusal_direction PublicForked from andyrdt/refusal_direction
Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
Python
If the problem persists, check the GitHub status page or contact support.