Currently doing AI Safety Research at FAR.AI.
Previously finished my PhD, reverse-engineering neural networks to understand how they think, so we can try to stop them from thinking bad things.
- Doing Research on technical alignment at FAR.AI - see my scholar page.
- Procrastinating with Side projects:
- AI-Safety-Papers β a living reading-list with concise notes.
Websiteβ| Twitter/X @afspiesβ|βLinkedInβ|ββοΈ alex [at] afspies (dot) com