Skip to content
View b5y's full-sized avatar
🇫🇮
Working from home
🇫🇮
Working from home

Organizations

@elfi-dev

Block or report b5y

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Self-hosted AI workspace.

Python 74,566 9,623 Updated Jun 19, 2026

Perplexity open source garden for inference technology

Rust 581 56 Updated May 27, 2026

DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

Python 228 23 Updated Feb 18, 2026

OSCAR: Offline Spectral Covariance-Aware Rotation for 2-bit KV Cache Quantization

Python 533 74 Updated Jun 17, 2026

A simple SWE style browser agent framework that achieves SOTA results on long horizon web tasks.

Python 5,520 344 Updated Jun 3, 2026

1 place to call all your agents - OpenCode, Hermes, Claude Managed Agents, Cursor Agents API, DeepAgents.

Rust 964 100 Updated Jun 18, 2026

Fastino's LLM guardrail

41 5 Updated May 12, 2026

Cuda kernels for leveraging LLM sparsity to improve throughput and decrease the memory requirements during inference and training.

Cuda 245 23 Updated May 14, 2026

Incremental engine for long horizon agents 🌟 Star if you like it!

Rust 10,420 811 Updated Jun 20, 2026

cuda-oxide is an experimental Rust-to-CUDA compiler that lets you write (SIMT) GPU kernels in safe(ish), idiomatic Rust. It compiles standard Rust code directly to PTX — no DSLs, no foreign languag…

Rust 2,795 188 Updated Jun 19, 2026

TokenSpeed is a speed-of-light LLM inference engine.

Python 1,464 161 Updated Jun 20, 2026

Polymarket Data Retriever that fetches, processes, and structures Polymarket data including markets, order events and trades.

Python 2,161 420 Updated May 27, 2026

talkie is a vintage language model from 1930

Python 911 56 Updated May 19, 2026

1K resolution vision transformers pretrained on 1B human images.

Python 808 52 Updated May 24, 2026

Bring agents to any interfaces

TypeScript 791 102 Updated Jun 19, 2026

TriAttention — Efficient long reasoning with trigonometric KV cache compression. Enables OpenClaw local deployment on memory-constrained GPUs.

Python 789 77 Updated Jun 18, 2026

NVIDIA AITune is an inference toolkit designed for tuning and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 275 31 Updated Jun 3, 2026

Dimensional is the agentic operating system for physical space. Vibecode humanoids, quadrupeds, drones, and other hardware platforms in natural language and build multi-agent systems that work seam…

Python 3,526 702 Updated Jun 20, 2026

Efficient Universal Perception Encoder: a single on-device vision encoder with versatile representations that match or exceed specialized experts across multiple task domains.

Python 666 38 Updated Apr 14, 2026

A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.

178,956 18,292 Updated Apr 20, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 56,040 7,258 Updated Jun 19, 2026

Multi-agent systems, memory, planning, reasoning loops

Jupyter Notebook 2,711 596 Updated Jun 20, 2026

Autoresearch for GPU kernels. Give it any PyTorch model, go to sleep, wake up to optimized Triton kernels.

Python 1,418 143 Updated Mar 19, 2026

autonomous harness engineering

Python 4,500 504 Updated Apr 3, 2026

AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.

JavaScript 54,807 10,866 Updated Jun 18, 2026

Project N.O.M.A.D, is a self-contained, offline survival computer packed with critical tools, knowledge, and AI to keep you informed and empowered—anytime, anywhere.

TypeScript 31,321 3,113 Updated Jun 20, 2026

Inference repo for Falcon-Perception and Falcon-OCR model, early-fusion, natively multimodal, dense Autoregressive Transformer models.

Python 723 68 Updated Apr 27, 2026

A Simple and Universal Swarm Intelligence Engine, Predicting Anything. 简洁通用的群体智能引擎,预测万物

Python 66,835 10,421 Updated May 24, 2026
Next