Stars
Multi functional app to find duplicates, empty folders, similar images etc.
Header only library for writing build recipes in C.
Trail of Bits Claude Code skills for security research, vulnerability detection, and audit workflows
Measuring frontier coding agents on original, long-horizon engineering tasks
SlopCodeBench: Measuring Code Erosion Under Iterative Specification Refinement
Applies diffs based on context, not line numbers. Useful for AI-generated code.
Free universal database tool and SQL client
A high performance implementation of HDBSCAN clustering.
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
An open source, self-hosted implementation of the Tailscale control server
Cntlm is an NTLM / NTLM Session Response / NTLMv2 authenticating HTTP proxy intended to help you break free from the chains of Microsoft proprietary world. More info on http://cntlm.sourceforge.net…
Reinforcement Learning via Self-Distillation (SDPO)
Cumulative Agentic Skill Creation through Autonomous Development and Evolution
A minimal implementation of DeepMind's Genie world model
Beads - A memory upgrade for your coding agent
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.
[ACL25] FEA-Bench: A Benchmark for Evaluating Repository-Level Code Generation for Feature Implementation
Beyond Sliding Windows: Learning to Manage Memory in Non-Markovian Environments
Roo Code gives you a whole dev team of AI agents in your code editor.
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
An Emacs framework for the stubborn martian hacker
WildEval / ZeroEval
Forked from allenai/WildBenchA simple unified framework for evaluating LLMs
Helpful kernel tutorials, examples and SKILLs for tile-based GPU programming