A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,868 112 Updated Jun 16, 2026

jwergieluk / revllm

RevLLM -- Reverse Engineering Tools for Large Language Models

Python 22 3 Updated Feb 29, 2024

iBibek / ascii_art

Forked from sepandhaghighi/art

🎨 ASCII art library for Python

Python 1 Updated Feb 9, 2024

iBibek / llm-latent-language

Forked from epfl-dlab/llm-latent-language

Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".

Jupyter Notebook 1 Updated Mar 8, 2024

iBibek / attention-visualization

Forked from mattneary/attention

visualizing attention for LLM users

Python 1 Updated May 24, 2023

iBibek / annotated_diffusion_pytorch

Forked from huggingface/blog

Public repo for HF blog posts

Jupyter Notebook 1 Updated Jun 9, 2022

iBibek / alpaca-lora

Forked from tloen/alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 1 1 Updated May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bibek Upadhayay iBibek

Achievements

Achievements

Organizations

Block or report iBibek

Stars

UNHSAILLab / sandwich-attack

Mihaiii / llm_steer

decoderesearch / SAELens

facebookresearch / large_concept_model

wegodev2 / virtual-prompt-injection

UNHSAILLab / working-memory-attack-on-llms

av / harbor

au-revoir / model-editing-ft

poloclub / transformer-explainer

aengusl / latent-adversarial-training

voidism / Lookback-Lens

ydyjya / Awesome-LLM-Safety

jwergieluk / revllm

iBibek / ascii_art

iBibek / llm-latent-language

iBibek / attention-visualization

iBibek / annotated_diffusion_pytorch

iBibek / alpaca-lora

iBibek / Convert-News-Feed-to-Audio-Files-using-GTTS-in-Python

epfl-dlab / llm-latent-language

nrimsky / LM-exp

allenai / science-parse

iBibek / MalConv-Deep-learning-for-PE-malware-classification

hkproj / pytorch-llama

UNHSAILLab / TaCo

m-popovic / chrF

turboderp-org / exllamav2

Teddy-Li / LLM-NLI-Analysis

karpathy / llama2.c

UNHSAILLab / Nepali-Alpaca-ChatGPT