Skip to content
View altock's full-sized avatar
🤖
🤖

Block or report altock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 18,004 2,911 Updated Nov 3, 2025
Python 1 Updated Feb 2, 2026

AIE Code Agents Hackathon NYC 2025

Python 3 1 Updated Jan 10, 2026

Browser extension that blocks algorithmic 'home' feeds, while preserving unique pages/links, DMs, search, and subscriptions

HTML 2 Updated Jan 22, 2026

Sutskever 30 implementations inspired by https://papercode.vercel.app/

Jupyter Notebook 3,194 435 Updated Feb 24, 2026
12 Updated Jan 11, 2026

Claude UX skill plugin for web app usability audits, accessibility checks, and design specs

6 Updated Dec 19, 2025

A super fast Graph Database uses GraphBLAS under the hood for its sparse adjacency matrix graph representation. Our goal is to provide the best Knowledge Graph for LLM (GraphRAG).

C 3,705 294 Updated Mar 12, 2026

This is the template I use to start new full-stack projects.

TypeScript 1,936 409 Updated Jun 16, 2025

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

Python 613 85 Updated Mar 1, 2026

MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user develop their prompts into full models.

Python 184 12 Updated Nov 6, 2025

AI chat assistant for Obsidian with contextual awareness, smart writing assistance, and one-click edits. Features vault-aware conversations, semantic search, and local model support.

TypeScript 2,154 168 Updated Feb 16, 2026

Fast State-of-the-Art Static Embeddings

Python 2,008 116 Updated Mar 12, 2026

LLM code

TypeScript 788 166 Updated May 5, 2025

The AI to keep you focused 😈

Python 415 42 Updated Feb 20, 2025

Inspect: A framework for large language model evaluations

Python 1,821 423 Updated Mar 12, 2026

LLM101n: Let's build a Storyteller

36,472 1,989 Updated Aug 1, 2024

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python 126,892 17,945 Updated Mar 12, 2026

PyTorch native post-training library

Python 5,697 705 Updated Mar 12, 2026

DSPy: The framework for programming—not prompting—language models

Python 32,754 2,683 Updated Mar 12, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 18,722 2,025 Updated Mar 9, 2026
Jupyter Notebook 9,665 678 Updated Oct 16, 2025

A modern cookiecutter template for Python projects that use Poetry for dependency management

Python 424 64 Updated Oct 2, 2024

Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".

137 19 Updated Mar 9, 2024

Code associated with the paper, Soft Prompts For Evaluation.

Python 6 Updated Feb 1, 2024

list of projects related to EA Software Engineers

28 4 Updated Jun 23, 2023
Python 1 Updated Feb 8, 2024

Machine Learning for Alignment Bootcamp

Jupyter Notebook 82 42 Updated Apr 27, 2022
Next