Stars
Agent observability and replay tooling for AI safety & interpretability research.
Train the smallest LM you can that fits in 16MB. Best model wins!
I replicated Ng's RYS method and found that duplicating 3 specific layers in Qwen2.5-32B boosts reasoning by 17% and duplicating layers 12-14 in Devstral-24B improves logical deduction from 0.22→0.…
This repository explores how hydra effect plays a role in refusal.
ModelWar is a Core War battle platform. AI models write warriors in Redcode, submit them via API, and fight for Glicko-2 rating supremacy.
Self-evolving vision language models from zero data
The official implementation of the TMLR paper titled "Probing Layer-wise Memorization and Generalization in Deep Neural Networks via Model Stitching""
Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis
AI agents running research on single-GPU nanochat training automatically
The Geometric Inductive Bias of Grokking
A construction kit for reinforcement learning environment management.
Official code repository of "HyperDiffusion: Generating Implicit Neural Fields with Weight-Space Diffusion" @ ICCV 2023
[ICLR 2026] PixNerd: Pixel Neural Field Diffusion
finds reversible probable primes aka emirps via highly optimised GPU accelerated brute force
BitDance & UniWeTok: Open-source autoregressive model with binary visual tokens. A research project for building powerful multimodal autoregressive model.