Skip to content
View gentlyzhao's full-sized avatar

Block or report gentlyzhao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
17 stars written in Python
Clear filter

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 66,164 6,649 Updated Jan 22, 2026

s1: Simple test-time scaling

Python 6,650 765 Updated Jun 25, 2025

Simple RL training for reasoning

Python 3,843 289 Updated Dec 23, 2025

A Python library to access Instagram's private API.

Python 3,244 641 Updated May 6, 2024

A library for mechanistic interpretability of GPT-style language models

Python 3,236 536 Updated Mar 27, 2026

Siamese and triplet networks with online pair/triplet mining in PyTorch

Python 3,169 633 Updated Apr 29, 2023

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 2,213 152 Updated Dec 8, 2025

Code release for Best-of-N Jailbreaking

Python 562 96 Updated Feb 5, 2025

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

Python 147 12 Updated Oct 13, 2025

Code for the paper "DiffusionNER: Boundary Diffusion for Named Entity Recognition", accepted at ACL 2023.

Python 114 20 Updated Aug 29, 2023

Awesome Jailbreak, red teaming arxiv papers (Automatically Update Every 12th hours)

Python 102 12 Updated Mar 27, 2026

Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as DeepSeek-R1 and OpenAI o1, which are currently very popular.

Python 82 6 Updated Mar 27, 2026

A new algorithm that formulates jailbreaking as a reasoning problem.

Python 26 4 Updated Jul 2, 2025

Official code implementation of SKU, Accepted by ACL 2024 Findings

Python 20 1 Updated Dec 18, 2024
Python 10 2 Updated Oct 19, 2024