Skip to content
View chanind's full-sized avatar

Highlights

  • Pro

Organizations

@steering-vectors

Block or report chanind

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 7 1 Updated May 20, 2025

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,802 66 Updated Jun 22, 2025

Chinese character stroke order animations and practice quizzes

TypeScript 4,038 617 Updated Nov 13, 2024

Code for reproducing our paper "Are Sparse Autoencoders Useful? A Case Study in Sparse Probing"

Jupyter Notebook 26 3 Updated Mar 31, 2025
Jupyter Notebook 32 12 Updated Apr 30, 2024
Python 107 30 Updated Jul 15, 2025
Python 335 31 Updated Jun 25, 2025

A playbook for systematically maximizing the performance of deep learning models.

28,969 2,380 Updated Jun 18, 2024
Python 35 9 Updated Jan 17, 2025

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…

Python 7,609 976 Updated Jul 21, 2025

Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"

Python 14 1 Updated Dec 14, 2024

A visual playground for agentic workflows: Iterate over your agents 10x faster

TypeScript 5,309 381 Updated Jul 20, 2025
Python 7 Updated Oct 29, 2024
Jupyter Notebook 182 38 Updated Jul 14, 2025
Jupyter Notebook 2 Updated Mar 25, 2024

Understanding Why and How Instruction Tuning Changes Pre-trained Models

Python 23 3 Updated Mar 18, 2024
Python 32 3 Updated Mar 4, 2024

Generate single text file that represents a python repository for LLMs

Python 2 Updated Jun 9, 2024

A Python library for doing curve matching with Fréchet distance and Procrustes analysis

Python 2 Updated Mar 27, 2024

Sparse Autoencoder for Mechanistic Interpretability

Python 257 44 Updated Jul 20, 2024

Training Sparse Autoencoders on Language Models

Python 884 179 Updated Jul 23, 2025
Jupyter Notebook 102 11 Updated Feb 11, 2025

Steering vectors for transformer language models in Pytorch / Huggingface

Python 117 13 Updated Feb 21, 2025

Steering Llama 2 with Contrastive Activation Addition

Jupyter Notebook 165 47 Updated May 23, 2024
Next