Skip to content
View ljubomirj's full-sized avatar

Block or report ljubomirj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Do multiple agents sharing a single asymmetrically-compressed KV pool produce quality output comparable to full-precision per-agent KV caches?

Python 8 2 Updated Apr 26, 2026

ROCmfp4

C 10 1 Updated May 25, 2026

NEW ROCmfp4 format for llama.cpp

C++ 95 8 Updated Jun 13, 2026

LLM-compiled knowledge bases for any AI agent. Parallel multi-agent research, thesis-driven investigation, source ingestion, wiki compilation, querying, and artifact generation.

Python 638 76 Updated Jun 14, 2026

A meta-harness for all your AI agents. Omnigent provides a common layer over Claude Code, Codex, Pi, and the agents you write yourself: swap or combine harnesses without rewriting, keep them in che…

Python 1,628 199 Updated Jun 15, 2026

A modern X11 server written from scratch in Rust.

Rust 329 19 Updated Jun 15, 2026

Head-to-head comparison of DeepSeek-V4-Flash vs Step-3.7-Flash on tool-eval-bench v2.0.6 (69 scenarios). Full results, summary, and analysis.

3 1 Updated Jun 13, 2026

Turn any document or a whole zip into an interactive knowledge graph, using a self-hosted Qwen3.6-35B-A3B-MTP on a single NVIDIA L4

Python 164 24 Updated Jun 14, 2026

AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary

Python 42,708 3,476 Updated Jun 10, 2026

Zonos2 is a leading open-weight text-to-speech MoE.

Python 176 20 Updated Jun 13, 2026

AI agent to evaluate and score resumes.

Python 1,126 348 Updated Jun 3, 2026

A generalist autonomous research agent — runs experiments, researches, and iteratively optimizes, autonomously.

Python 311 38 Updated Jun 15, 2026

[ICLR2026] Official Code for Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall

Python 6 Updated Jun 13, 2026

Official implementation of "Streaming Communication in Multi-Agent Reasoning"

34 1 Updated Jun 6, 2026

A fork of OpenCode for local AI models.

TypeScript 4 Updated Jun 8, 2026

Official implementation of DiscoGen, for "Procedural Generation of Algorithm Discovery Tasks in Machine Learning"

Python 41 8 Updated Jun 9, 2026

Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x

Python 124 10 Updated Jun 10, 2026
Python 78 3 Updated Jun 12, 2026

Loop engineering for agentic software delivery.

TypeScript 280 15 Updated Jun 15, 2026

AI-Driven Scientific and Algorithmic Discovery

Python 553 76 Updated Jun 14, 2026
225 20 Updated Jun 9, 2026

LLM4AD: A Platform for Algorithm Design with Large Language Model

Python 732 93 Updated Apr 15, 2026

Export tweets, bookmarks, lists and much more from Twitter(X) web app. (推文/书签/收藏/列表导出工具)

TypeScript 2,544 188 Updated May 12, 2026

Train the smallest LM you can that fits in 16MB. Best model wins!

Python 5,130 3,340 Updated May 4, 2026

Production LLM inference on the Apple Neural Engine — a practitioner's guide, complete with converters, Swift runtimes, and validated model manifests

Python 32 6 Updated Jun 7, 2026
Python 1,862 310 Updated May 29, 2026

Use your NVIDIA GPU's VRAM as swap space on Linux. Built for laptops with soldered memory and no upgrade path. If you have an RTX card sitting there with 8GB of VRAM and you're getting swapped to S…

Shell 478 12 Updated Jun 12, 2026

Garry's Opinionated OpenClaw/Hermes Agent Brain

TypeScript 22,848 3,268 Updated Jun 14, 2026

This repository contains the code implemented and used to generate all the content of the manuscript submitted to the Nature portfolio journal.

Python 14 3 Updated Jun 9, 2026

Reproduction code for Lattice Deduction Transformers

Python 35 8 Updated Jun 9, 2026
Next