-
F9 Research
- Harpenden, UK
- https://ljubomirj.github.io/
- @ljupc0
- @ljupco.bsky.social
- in/ljubomirjosifovski
Stars
Do multiple agents sharing a single asymmetrically-compressed KV pool produce quality output comparable to full-precision per-agent KV caches?
LLM-compiled knowledge bases for any AI agent. Parallel multi-agent research, thesis-driven investigation, source ingestion, wiki compilation, querying, and artifact generation.
A meta-harness for all your AI agents. Omnigent provides a common layer over Claude Code, Codex, Pi, and the agents you write yourself: swap or combine harnesses without rewriting, keep them in che…
Head-to-head comparison of DeepSeek-V4-Flash vs Step-3.7-Flash on tool-eval-bench v2.0.6 (69 scenarios). Full results, summary, and analysis.
Turn any document or a whole zip into an interactive knowledge graph, using a self-hosted Qwen3.6-35B-A3B-MTP on a single NVIDIA L4
AI agent skill that researches any topic across Reddit, X, YouTube, HN, Polymarket, and the web - then synthesizes a grounded summary
Zonos2 is a leading open-weight text-to-speech MoE.
AI agent to evaluate and score resumes.
A generalist autonomous research agent — runs experiments, researches, and iteratively optimizes, autonomously.
[ICLR2026] Official Code for Loopholing Discrete Diffusion: Deterministic Bypass of the Sampling Wall
Official implementation of "Streaming Communication in Multi-Agent Reasoning"
PlunderStruck / opencode
Forked from anomalyco/opencodeA fork of OpenCode for local AI models.
Official implementation of DiscoGen, for "Procedural Generation of Algorithm Discovery Tasks in Machine Learning"
Winner 🏆 (Agent-only) MLSys 2026 - FlashInfer AI Kernel Generation Contest for the DeepSeek Sparse Attention (DSA) track with an average speedup of 34.93x
Loop engineering for agentic software delivery.
AI-Driven Scientific and Algorithmic Discovery
LLM4AD: A Platform for Algorithm Design with Large Language Model
Export tweets, bookmarks, lists and much more from Twitter(X) web app. (推文/书签/收藏/列表导出工具)
Train the smallest LM you can that fits in 16MB. Best model wins!
Production LLM inference on the Apple Neural Engine — a practitioner's guide, complete with converters, Swift runtimes, and validated model manifests
Use your NVIDIA GPU's VRAM as swap space on Linux. Built for laptops with soldered memory and no upgrade path. If you have an RTX card sitting there with 8GB of VRAM and you're getting swapped to S…
Garry's Opinionated OpenClaw/Hermes Agent Brain
This repository contains the code implemented and used to generate all the content of the manuscript submitted to the Nature portfolio journal.
Reproduction code for Lattice Deduction Transformers