Skip to content
View kyr0's full-sized avatar
hyperfocus
hyperfocus

Organizations

@fuse-box @springtype-org @machbarschaft @Colivery @Fluctura @PCemOnMac @allchords @kim-bayern-dev

Block or report kyr0

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

nyanos-v1

C 3 Updated Jul 3, 2026

TiRex-2: A Zero-Shot Timeseries Forecasting Model

Python 33 4 Updated Jul 2, 2026

Grafting script and vLLM container inference runtime Makefile for kyr0/Ornith-35B-FP8-E4M3-MTP

Shell 2 Updated Jul 2, 2026

JetSpec: Breaking the Scaling Ceiling of Speculative Decoding with Causal Parallel Tree Drafting

Python 145 6 Updated Jun 27, 2026
Python 7 2 Updated Jun 27, 2026

Fully uncensored, capability-enhanced abliteration of Qwen3.6-27B. NVFP4 + z-lab DFlash speculative decoding (n=12) on the unified ghcr.io/aeon-7/aeon-vllm-ultimate:latest container, tuned for long…

Python 396 39 Updated Jun 28, 2026

First foundation ASR built for the real world - 7 atomic acoustic conditions, 54 compound scenarios, 2.6M samples, and up to ~30% gains over SOTA where every other model falls apart. **You'll come …

Python 1,055 69 Updated Jun 2, 2026

Building Foundation Models for Human Behavior Simulation

Python 103 13 Updated Jul 2, 2026

The easiest and fastest way to create production-ready Kubernetes clusters on Hetzner Cloud

Crystal 3,614 226 Updated Jun 24, 2026
20 Updated May 31, 2026

Give a query, get a dataroom. Pi + self-hosted Qwen3.6 research harness on a single L4.

Python 172 17 Updated Jun 20, 2026

Build Real-Time Knowledge Graphs for AI Agents

Python 28,308 2,836 Updated Jul 2, 2026

We propose Bidirectional Evolutionary Search (BES), a search framework that couples forward candidate evolution with backward goal decomposition.

Python 160 15 Updated May 28, 2026

Build product integrations with AI.

TypeScript 10,933 1,176 Updated Jul 2, 2026

An open source, self-hosted implementation of the Tailscale control server

Go 40,820 2,250 Updated Jul 1, 2026
Jupyter Notebook 214 21 Updated May 20, 2026

SkillOpt with local AI is a text-space optimizer that trains reusable natural-language skills for frozen LLM agents through trajectory-driven edits, validation-gated updates, and deployable best_sk…

Python 79 19 Updated May 25, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 7,407 1,300 Updated Jul 3, 2026

The AI search platform

Java 6,985 723 Updated Jul 3, 2026

Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.

Swift 12,862 1,312 Updated Jun 30, 2026

Run 70B+ LLMs on Apple Silicon by using SSD as extended memory — intelligent layer streaming and caching for Mac

Rust 22 3 Updated Mar 14, 2026

Fast and Accurate Code Search for Agents. Uses ~98% fewer tokens than grep+read

Python 5,477 233 Updated Jul 2, 2026

reverse engineering Gemini's SynthID detection

Python 4,509 486 Updated Apr 29, 2026
TypeScript 4 1 Updated Apr 15, 2026

Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.

Python 678 40 Updated Apr 14, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 56,900 7,350 Updated Jul 3, 2026

HeadAudio: An audio node/processor for real-time audio-driven viseme detection and lip-sync in browsers.

JavaScript 33 8 Updated Dec 10, 2025

Talking Head (3D): A JavaScript class for real-time lip-sync using full-body 3D avatars.

JavaScript 1,375 317 Updated Jun 2, 2026

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on your Mac using MLX.

Python 411 54 Updated May 13, 2026

LLM inference server with continuous batching & SSD caching for Apple Silicon — managed from the macOS menu bar

Python 17,426 1,474 Updated Jul 3, 2026
Next