Skip to content
View b5y's full-sized avatar
🇫🇮
Working from home
🇫🇮
Working from home

Organizations

@elfi-dev

Block or report b5y

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The end of web parsing. The beginning of scalable pixel-native search.

Python 2,355 203 Updated Jun 20, 2026

Self-hosted AI workspace.

Python 75,671 9,829 Updated Jun 19, 2026

Perplexity open source garden for inference technology

Rust 584 56 Updated May 27, 2026

DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

Python 229 23 Updated Feb 18, 2026

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Python 534 74 Updated Jun 17, 2026

A simple SWE style browser agent framework that achieves SOTA results on long horizon web tasks.

Python 5,533 348 Updated Jun 3, 2026

1 place to call all your agents - OpenCode, Hermes, Claude Managed Agents, Cursor Agents API, DeepAgents.

Rust 974 103 Updated Jun 20, 2026

Fastino's LLM guardrail

41 5 Updated May 12, 2026

Cuda kernels for leveraging LLM sparsity to improve throughput and decrease the memory requirements during inference and training.

Cuda 245 23 Updated May 14, 2026

Incremental engine for long horizon agents 🌟 Star if you like it!

Rust 10,444 812 Updated Jun 21, 2026

cuda-oxide is an experimental Rust-to-CUDA compiler that lets you write (SIMT) GPU kernels in safe(ish), idiomatic Rust. It compiles standard Rust code directly to PTX — no DSLs, no foreign languag…

Rust 2,800 190 Updated Jun 20, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,469 163 Updated Jun 21, 2026

Polymarket Data Retriever that fetches, processes, and structures Polymarket data including markets, order events and trades.

Python 2,172 420 Updated May 27, 2026

talkie is a vintage language model from 1930

Python 913 56 Updated May 19, 2026

1K resolution vision transformers pretrained on 1B human images.

Python 810 52 Updated May 24, 2026

Bring agents to any interfaces

TypeScript 820 104 Updated Jun 21, 2026

TriAttention — Efficient long reasoning with trigonometric KV cache compression. Enables OpenClaw local deployment on memory-constrained GPUs.

Python 791 77 Updated Jun 18, 2026

NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 275 31 Updated Jun 3, 2026

Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seam…

Python 3,536 705 Updated Jun 21, 2026

Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.

Python 666 38 Updated Apr 14, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

179,669 18,384 Updated Apr 20, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 56,106 7,266 Updated Jun 20, 2026

Multi-agent systems, memory, planning, reasoning loops

Jupyter Notebook 2,715 597 Updated Jun 21, 2026

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,418 143 Updated Mar 19, 2026

autonomous harness engineering

Python 4,501 504 Updated Apr 3, 2026

AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.

JavaScript 55,048 10,901 Updated Jun 21, 2026

Project N.O.M.A.D, is a self-contained, offline survival computer packed with critical tools, knowledge, and AI to keep you informed and empowered—anytime, anywhere.

TypeScript 31,470 3,138 Updated Jun 20, 2026

Inference repo for Falcon-Perception and Falcon-OCR model, early-fusion, natively multimodal, dense Autoregressive Transformer models.

Python 723 68 Updated Apr 27, 2026
Next