Skip to content
View Xynonners's full-sized avatar

Block or report Xynonners

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Multi-model AI consensus loop via CLI

Python 8 1 Updated Jun 14, 2026
Python 22 2 Updated May 1, 2026

Makes your AI agent think like the laziest senior dev in the room. The best code is the code you never wrote.

JavaScript 8,884 380 Updated Jun 15, 2026

Analyzing available cost token and time for ant vs oai trajectories that are available.

Python 3 Updated Apr 23, 2026
TypeScript 8,584 731 Updated Jun 15, 2026

Open-source observability tool that uses AI agents to self-heal your software

TypeScript 821 51 Updated Jun 15, 2026

Clearing the nanoGPT speedrun's 3.28 val-loss target on one H200 by stacking the Aurora optimizer and Token Superposition Training (TST) on an untouched Transformer.

Python 8 2 Updated Jun 8, 2026

Local Responses-API shim that exposes Factory BYOK models (and optional ChatGPT GPT-5.5 passthrough) to Codex Desktop.

Python 928 88 Updated Jun 13, 2026

+3M Downloads! Repair invalid LLM JSON, commonly used to parse the output of LLMs — Parsing ChatGPT and llm JSON stream response — Partial and incomplete JSON parser python library for OpenAI | rep…

Python 101 7 Updated Feb 19, 2026

Validate, repair, and retry LLM structured outputs. 13 repair strategies for common JSON malformations, JSON Schema validation, and retry-with-feedback prompts.

Python 60 Updated May 14, 2026

Rust-backed repair of malformed JSON for LLM-style outputs

Rust 2 Updated Apr 24, 2026

Permanent memory for AI agents. Single binary, zero dependencies, MCP native.

Rust 432 47 Updated Jun 14, 2026

Compress tool outputs, logs, files, and RAG chunks before they reach the LLM. 60-95% fewer tokens, same answers. Library, proxy, MCP server.

Python 27,785 1,882 Updated Jun 14, 2026

Unlock 2x more Claude Code usage

Rust 185 12 Updated Jun 12, 2026

(HAM) Memory system for AI coding agents. Cut token usage by 80% by scoping context to directories.

JavaScript 61 3 Updated Mar 21, 2026

The Context OS for Autonomous AI Agents. Distill terminal noise into pure semantic signal, stop agent hallucinations, and cut token costs by up to 90%.

Rust 249 25 Updated Jun 15, 2026

Never stop coding. Free AI gateway: one endpoint, 160+ providers (50+ free), connect Claude Code, Codex, Cursor, Cline & Copilot to FREE Claude/GPT/Gemini. RTK+Caveman stacked compression saves 15-…

TypeScript 6,171 1,081 Updated Jun 15, 2026

The batteries-included agent harness.

Python 24,622 3,488 Updated Jun 14, 2026

An ongoing, collaborative meta-analysis about Human-AI-Interactions. We aggregate data and knowledge to build a non-abrasive, user-friendly prompting framework tailored to LLM mechanics, ensuring r…

101 6 Updated Jun 9, 2026

Official implementation of paper "ACON: Optimizing Context Compression for Long-horizon LLM Agents"

Python 87 13 Updated Oct 14, 2025

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 24,558 1,090 Updated Jun 12, 2026

Parallax Engine plugin for OpenCode -- friction-loop verification, mode switching (plan/build/debug), multi-perspective reasoning, and the 4 invariants framework

TypeScript 4 Updated Jun 7, 2026

vLLM fork for Tesla V100 (SM70) with AWQ 4-bit support, CUDA 12.8 build flow, and validated Qwen3.5 27B/35B deployment on multi-GPU V100.

Python 417 70 Updated Jun 15, 2026

ADHD — a skill for coding agents. Tree-of-thought with pruning, built on the Claude & Codex Agent SDK. Fans out parallel divergent thoughts under different cognitive frames, scores, prunes traps, d…

TypeScript 813 40 Updated Jun 4, 2026

Fast, lossless LLM inference via dual-view diffusion decoding.

Python 423 17 Updated May 18, 2026

Tasks for planning - enhanced with Hiveminds for multi-model reviews, Swarms for long running autonomous tasks. Nurse to keep things running!

Rust 16 Updated May 27, 2026

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

Python 3,279 106 Updated May 11, 2026

[ICLR 2026] ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Python 306 29 Updated Jun 8, 2026

DFlash: Block Diffusion for Flash Speculative Decoding

Python 5,108 370 Updated May 10, 2026

Live-SWE-agent: live, runtime self-evolving software engineering agent

406 40 Updated Jan 19, 2026
Next