Skip to content
View tmaiaroto's full-sized avatar

Highlights

  • Pro

Organizations

@SocialHarvest

Block or report tmaiaroto

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Test LLMs on real tasks. Compare models side-by-side.

TypeScript 254 23 Updated May 18, 2026

Control panel for VLLM, Sglang, llama.cpp, exllamav3

TypeScript 971 76 Updated May 18, 2026

Orca is the next-gen IDE for working with a fleet of parallel agents. Run any coding agent with your own subscription. Available on desktop and mobile

TypeScript 2,678 182 Updated May 18, 2026

Desktop Companion for Hermes Agent

TypeScript 5,633 667 Updated May 17, 2026

Atomic-Chat is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 662 60 Updated May 15, 2026

llama.cpp fork with TurboQuant WHT-rotated KV cache & weight compression + Gemma 4 MTP and Qwen 3.6 NextN speculative decoding (+30-50% throughput).

C++ 189 22 Updated May 14, 2026

Browser Harness | Self-healing harness that enables LLMs to complete any task.

Python 13,072 1,197 Updated May 15, 2026

Mutation testing for Go source code. Fork from https://github.com/zimmski/go-mutesting

Go 235 27 Updated Jan 12, 2026

llama.cpp fork with TQ3_1S/4S CUDA kernels β€” 3.5-bit WHT quantization achieving Q4s quality at 10% smaller size. Based on RaBitQ-inspired Walsh-Hadamard transform. Enables 27B models on 16GB GPUs w…

C++ 183 8 Updated May 17, 2026

llama.cpp fork with additional SOTA quants and improved performance

C++ 2,479 317 Updated May 18, 2026

A knowledge graph for the notes you already have. Plain Markdown, git-native, fully local. Wire anything to anything with semantic predicates β€” supports, depends on, contradicts, relates to goal β€” …

Python 4 Updated May 13, 2026

The open-source security layer for AI agents. Deterministic guardrails, PII redaction, and EU AI Act compliance in one line of code.

TypeScript 20 Updated May 12, 2026

This solution provides an automated, serverless way to redact sensitive data from PDF files using Google Cloud Services like Data Loss Prevention (DLP), Cloud Workflows, and Cloud Run.

HCL 65 27 Updated Feb 20, 2026

Community recipes for serving LLMs on RTX 3090. Multi-engine (vLLM, llama.cpp, SGLang) and model-agnostic. Currently shipping Qwen3.6-27B configs for 1Γ— and 2Γ— cards.

Python 982 49 Updated May 18, 2026

Your First LLM-Wiki Conversation Knowledge Base

Python 331 40 Updated May 18, 2026

Memory kernel stack for Hermes agent

Python 28 2 Updated May 16, 2026

King of Spades: a cinematic playing-card dashboard theme for Hermes Agent.

Shell 4 Updated Apr 26, 2026

A modern platform for visual, flexible, and extensible graph-based investigations. For cybersecurity analysts and investigators.

TypeScript 3,433 459 Updated May 15, 2026

Terminal AI that reads code as code. AST surgery, LSP operations, a live dependency graph β€” not grep-and-paste.

TypeScript 688 45 Updated May 17, 2026

Custom skins (visual themes) for the Hermes CLI agent

Python 365 21 Updated May 7, 2026

Penpot: The open-source design tool for design and code collaboration

Clojure 47,827 2,999 Updated May 17, 2026

A standalone BMAD module that transforms code repositories, documentation websites, and developer discourse into agentskills.io-compliant, version-pinned, provenance-backed agent skills.

Python 70 6 Updated May 16, 2026

llama.cpp fork with TurboQuant quantization (turbo2/3/4) and TriAttention GPU-accelerated KV cache pruning. 75 tok/s on Qwen3-8B / RTX 3080.

C++ 26 6 Updated Apr 9, 2026

Hermes WebUI: The best way to use Hermes Agent from the web or from your phone!

Python 7,622 1,029 Updated May 18, 2026

The headless browser for AI agents and web scraping

Rust 13,125 839 Updated May 16, 2026

Lucebox: LLM inference server built for speed for specific consumer hardware.

C++ 2,139 200 Updated May 17, 2026

OWASP Autonomous Penetration Testing Standard

Python 655 84 Updated May 15, 2026

Supercharge Your LLM Application Evaluations πŸš€

Python 13,940 1,423 Updated Feb 24, 2026
Next