Stars
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
A library for mechanistic interpretability of GPT-style language models
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
🤖 Intelligent integration between Claude Code and Google Gemini for large-scale code analysis
Official code implementation of SKU, Accepted by ACL 2024 Findings
Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as DeepSeek-R1 and OpenAI o1, which are currently very popular.
Provide .bst files for NeurIPS latex template
Awesome Jailbreak, red teaming arxiv papers (Automatically Update Every 12th hours)
A new algorithm that formulates jailbreaking as a reasoning problem.
Code release for Best-of-N Jailbreaking
A beautiful, simple, clean, and responsive Jekyll theme for academics
Code for the paper "DiffusionNER: Boundary Diffusion for Named Entity Recognition", accepted at ACL 2023.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
generate a PDF file with input as XML and XSLT files
A Python library to access Instagram's private API.
Siamese and triplet networks with online pair/triplet mining in PyTorch