smcaleese

Stephen McAleese smcaleese

14 followers · 56 following

Dublin, Ireland
stephenmcaleese.com

Achievements

Stars

UKGovernmentBEIS / inspect_ai

Inspect: A framework for large language model evaluations

Python 1,608 363 Updated Dec 23, 2025

devindkim / devGPT

A simple pytorch implementation of GPT-2, optimized to run on Macbook Pro M1/M2.

Python 1 Updated Mar 19, 2024

OpenLMLab / MOSS-RLHF

Secrets of RLHF in Large Language Models Part I: PPO

Python 1,408 105 Updated Mar 3, 2024

janweh / Counterfactual-Trajectory-Explanations-for-Learned-Reward-Functions

Code for my Master Thesis: I generate Counterfactual Trajectory Explanations about Reward Functions that were learned with Inverse Reinforcemnet Learning

Jupyter Notebook 1 1 Updated Oct 30, 2023

callummcdougall / ARENA_3.0

Jupyter Notebook 852 537 Updated Nov 12, 2025

jprivera44 / LLM_Sycophancy

This project focuses on the work of understanding sycophantic behavior within LLMs.

Jupyter Notebook 1 Updated Feb 15, 2024

callummcdougall / ARENA_2.0

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML 236 90 Updated Aug 11, 2025

nrimsky / CAA

Steering Llama 2 with Contrastive Activation Addition

Jupyter Notebook 201 59 Updated May 23, 2024

nrimsky / LM-exp

LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces

Jupyter Notebook 100 28 Updated Sep 21, 2023

PauseAI / pauseai-website

Website for PauseAI.info

Svelte 23 63 Updated Dec 22, 2025

sophiamyang / tutorials-LangChain

Jupyter Notebook 423 184 Updated Oct 19, 2023

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,343 8,600 Updated Nov 12, 2025

mrahtz / learning-from-human-preferences

Reproduction of OpenAI and DeepMind's "Deep Reinforcement Learning from Human Preferences"

Python 333 69 Updated Nov 29, 2021

riceissa / aiwatch

Website to track people, organizations, and products (tools, websites, etc.) in AI safety

HTML 23 11 Updated Dec 23, 2025

jqhoogland / rationalia-starter

An Obsidian starter kit for LessWrong, Effective Altruism, AI Alignment, etc.

JavaScript 14 2 Updated Nov 12, 2022

jonathanpaulson / grabby_aliens

C++ 88 12 Updated Apr 6, 2022

vercel / next.js

The React Framework

JavaScript 136,743 30,114 Updated Dec 23, 2025

codecrafters-io / build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Markdown 451,503 42,349 Updated Oct 10, 2025

apptension / developer-handbook

An opinionated guide on how to become a professional Web/Mobile App Developer.

5,963 641 Updated Feb 16, 2024

noahshinn / reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 3,001 290 Updated Jan 14, 2025

leetcode-mafia / cheetah

Mac app for crushing tech interviews with AI

Swift 4,255 304 Updated Jan 14, 2025

hamishhuggard / AI-alignment-map

A map of the AI alignment landscape

JavaScript 10 2 Updated Aug 23, 2025

nickypro / gatsby-personal-website

Personal Website built using GatsbyJS and Strapi

JavaScript 1 Updated Nov 18, 2020

nickypro / separability

Tools for working with Language Models

Python 9 2 Updated Jan 24, 2024

epoch-research / data-stock

Models for data stocks and training dataset sizes

Python 18 4 Updated Jul 10, 2024

nottombrown / rl-teacher

Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback

Python 562 96 Updated Jan 24, 2023

trishume / trishume.github.com

My website/blog thing. Made with Jekyll.

HTML 9 9 Updated Apr 17, 2025

MaksimIM / JaynesProbabilityTheory

Exercise solutions and explanations for the book Probability Theory: The Logic of Science by E.T. Jaynes. Created by the reading group at r/jaynesprobability

HTML 63 4 Updated Oct 25, 2021

socketteer / loom

Multiversal tree writing interface for human-AI collaboration

Python 1,322 88 Updated Jun 28, 2024

Agents-of-Change / adventofai

Advent of Code AI (unfinished)

HTML 1 Updated Dec 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stephen McAleese smcaleese

Achievements

Achievements

Block or report smcaleese

Stars

UKGovernmentBEIS / inspect_ai

devindkim / devGPT

OpenLMLab / MOSS-RLHF

janweh / Counterfactual-Trajectory-Explanations-for-Learned-Reward-Functions

callummcdougall / ARENA_3.0

jprivera44 / LLM_Sycophancy

callummcdougall / ARENA_2.0

nrimsky / CAA

nrimsky / LM-exp

PauseAI / pauseai-website

sophiamyang / tutorials-LangChain

karpathy / nanoGPT

mrahtz / learning-from-human-preferences

riceissa / aiwatch

jqhoogland / rationalia-starter

jonathanpaulson / grabby_aliens

vercel / next.js

codecrafters-io / build-your-own-x

apptension / developer-handbook

noahshinn / reflexion

leetcode-mafia / cheetah

hamishhuggard / AI-alignment-map

nickypro / gatsby-personal-website

nickypro / separability

epoch-research / data-stock

nottombrown / rl-teacher

trishume / trishume.github.com

MaksimIM / JaynesProbabilityTheory

socketteer / loom

Agents-of-Change / adventofai