Skip to content
View smcaleese's full-sized avatar

Block or report smcaleese

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inspect: A framework for large language model evaluations

Python 1,608 363 Updated Dec 23, 2025

A simple pytorch implementation of GPT-2, optimized to run on Macbook Pro M1/M2.

Python 1 Updated Mar 19, 2024

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,408 105 Updated Mar 3, 2024

Code for my Master Thesis: I generate Counterfactual Trajectory Explanations about Reward Functions that were learned with Inverse Reinforcemnet Learning

Jupyter Notebook 1 1 Updated Oct 30, 2023
Jupyter Notebook 852 537 Updated Nov 12, 2025

This project focuses on the work of understanding sycophantic behavior within LLMs.

Jupyter Notebook 1 Updated Feb 15, 2024

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML 236 90 Updated Aug 11, 2025

Steering Llama 2 with Contrastive Activation Addition

Jupyter Notebook 201 59 Updated May 23, 2024

LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces

Jupyter Notebook 100 28 Updated Sep 21, 2023

Website for PauseAI.info

Svelte 23 63 Updated Dec 22, 2025
Jupyter Notebook 423 184 Updated Oct 19, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,343 8,600 Updated Nov 12, 2025

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Python 333 69 Updated Nov 29, 2021

Website to track people, organizations, and products (tools, websites, etc.) in AI safety

HTML 23 11 Updated Dec 23, 2025

An Obsidian starter kit for LessWrong, Effective Altruism, AI Alignment, etc.

JavaScript 14 2 Updated Nov 12, 2022

The React Framework

JavaScript 136,743 30,114 Updated Dec 23, 2025

Master programming by recreating your favorite technologies from scratch.

Markdown 451,503 42,349 Updated Oct 10, 2025

An opinionated guide on how to become a professional Web/Mobile App Developer.

5,963 641 Updated Feb 16, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,001 290 Updated Jan 14, 2025

Mac app for crushing tech interviews with AI

Swift 4,255 304 Updated Jan 14, 2025

A map of the AI alignment landscape

JavaScript 10 2 Updated Aug 23, 2025

Personal Website built using GatsbyJS and Strapi

JavaScript 1 Updated Nov 18, 2020

Tools for working with Language Models

Python 9 2 Updated Jan 24, 2024

Models for data stocks and training dataset sizes

Python 18 4 Updated Jul 10, 2024

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

Python 562 96 Updated Jan 24, 2023

My website/blog thing. Made with Jekyll.

HTML 9 9 Updated Apr 17, 2025

Exercise solutions and explanations for the book Probability Theory: The Logic of Science by E.T. Jaynes. Created by the reading group at r/jaynesprobability

HTML 63 4 Updated Oct 25, 2021

Multiversal tree writing interface for human-AI collaboration

Python 1,322 88 Updated Jun 28, 2024

Advent of Code AI (unfinished)

HTML 1 Updated Dec 28, 2022
Next