Skip to content
View altock's full-sized avatar
🤖
🤖

Block or report altock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,754 2,992 Updated Apr 14, 2026
Python 1 Updated Feb 2, 2026

AIE Code Agents Hackathon NYC 2025

Python 4 1 Updated Jan 10, 2026

Browser extension that blocks algorithmic 'home' feeds, while preserving unique pages/links, DMs, search, and subscriptions

HTML 2 Updated Jan 22, 2026

Sutskever 30 implementations inspired by https://papercode.vercel.app/ | For Agents, use https://github.com/pageman/Sutskever-Agent | Polyglot / Multi-Backed version at https://github.com/pageman/s…

Jupyter Notebook 3,279 446 Updated Mar 15, 2026

Claude Code skill that generates UIs in the National Design Studio style used by realfood.gov, trumprx.gov, and other "America by Design" government sites.

16 Updated Jan 11, 2026

Claude UX skill plugin for web app usability audits, accessibility checks, and design specs

6 Updated Dec 19, 2025

A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

C 4,652 394 Updated Jun 23, 2026

This is the template I use to start new full-stack projects.

TypeScript 1,935 398 Updated Jun 16, 2025

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

Python 635 89 Updated Apr 21, 2026

MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.

Python 184 12 Updated Nov 6, 2025

AI chat assistant for Obsidian with contextual awareness, smart writing assistance, and one-click edits. Features vault-aware conversations, semantic search, and local model support.

TypeScript 2,296 187 Updated Feb 16, 2026

Fast State-of-the-Art Static Embeddings

Python 2,132 122 Updated Jun 6, 2026

LLM code

TypeScript 789 161 Updated May 5, 2025

The AI to keep you focused 😈

Python 412 40 Updated Feb 20, 2025

Inspect: A framework for large language model evaluations

Python 2,240 572 Updated Jun 23, 2026

LLM101n: Let's build a Storyteller

37,369 2,053 Updated Aug 1, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 142,758 20,541 Updated Jun 22, 2026

PyTorch native post-training library

Python 5,776 730 Updated Jun 23, 2026

DSPy: The framework for programming—not prompting—language models

Python 35,328 2,998 Updated Jun 18, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 19,604 2,143 Updated Jun 22, 2026
Jupyter Notebook 9,661 678 Updated Oct 16, 2025

A modern cookiecutter template for Python projects that use Poetry for dependency management

Python 427 63 Updated Oct 2, 2024

Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".

149 27 Updated Mar 9, 2024

Code associated with the paper, Soft Prompts For Evaluation.

Python 6 Updated Feb 1, 2024

list of projects related to EA Software Engineers

28 4 Updated Jun 23, 2023
Python 1 Updated Feb 8, 2024

Machine Learning for Alignment Bootcamp

Jupyter Notebook 83 42 Updated Apr 27, 2022
Next