fm1320

Filip Makraduli fm1320

Master's grad from Imperial College London, ML engineer, very small angel investor, developer relations

29 followers · 10 following

Achievements

Lists (2)

Sort

learning

1 repository

🚀 My stack

1 repository

Stars

bilgeyucel / opensearch-cognee-haystack

⚡ Haystack + OpenSearch + Cognee — hybrid search, graph memory, streaming answers.

Python 5 Updated Jun 9, 2026

mostlygeek / llama-swap

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

Go 4,619 351 Updated Jun 16, 2026

tile-ai / tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,506 604 Updated Jun 16, 2026

superlinked / sie

Open-source inference server and production cluster for all the models your agent needs.

Python 2,052 183 Updated Jun 16, 2026

vllm-project / vllm-metal

Community maintained hardware plugin for vLLM on Apple Silicon

Python 1,331 158 Updated Jun 16, 2026

m0at / rvllm

rvLLM: High-performance LLM inference in Rust. Drop-in vLLM replacement.

Rust 744 69 Updated Jun 15, 2026

girijesh-ai / ai-interview-codex

Comprehensive ML/AI interview codex with iterative system design, production-ready code, and 2026 standards. Includes LLM/GenAI, RAG systems, agentic AI, and algorithms from scratch.

Python 437 70 Updated Jan 16, 2026

opendataloader-project / opendataloader-pdf

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 25,177 2,375 Updated Jun 16, 2026

getlago / inside-lago-voice-skill

A framework for teaching AI to write like you. Not like a better version of you. Like you.

304 44 Updated Apr 13, 2026

MemPalace / mempalace

The best-benchmarked open-source AI memory system. And it's free.

Python 55,735 7,223 Updated Jun 15, 2026

santifer / career-ops

AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.

JavaScript 54,204 10,758 Updated Jun 16, 2026

RyanCodrai / turbovec

A vector index built on TurboQuant, written in Rust with Python bindings

Python 11,768 1,023 Updated Jun 10, 2026

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,084 18,126 Updated Jun 16, 2026

MarkoKarbevski / beyond_query_linearity

Code supporting the "Beyond Linearity in Attention Projections: The Case for Nonlinear Queries" paper

Jupyter Notebook 2 1 Updated May 26, 2026

weaviate / weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 16,335 1,310 Updated Jun 16, 2026

marimo-team / marimo

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 21,473 1,132 Updated Jun 16, 2026

karpathy / autoresearch

AI agents running research on single-GPU nanochat training automatically

Python 87,182 12,627 Updated Mar 26, 2026

OpenMachine-ai / transformer-tricks

A collection of tricks and tools to speed up transformer models

TeX 207 13 Updated May 6, 2026

HenryNdubuaku / maths-cs-ai-compendium

Become a cracked AI/ML Research Engineer

TypeScript 4,520 626 Updated Jun 14, 2026

slidevjs / slidev

Presentation Slides for Developers

TypeScript 47,217 2,103 Updated Jun 3, 2026

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 6,185 623 Updated Jun 15, 2026

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,877 601 Updated Jun 16, 2026

ttsugriy / performance-book

Jupyter Notebook 23 2 Updated Feb 13, 2026

lyogavin / airllm

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 20,063 2,246 Updated Mar 10, 2026

gastownhall / beads

Beads - A memory upgrade for your coding agent

Go 24,567 1,647 Updated Jun 16, 2026

minitap-ai / mobile-use

AI agents can now use real Android and iOS apps, just like a human.

Python 2,602 223 Updated Jun 10, 2026

fullstackwebdev / rlm_repl

Recursive Language Models (RLMs) implementation based on the paper by Zhang, Kraska, and Khattab

Python 239 21 Updated Jan 3, 2026

Shekswess / open-corpus-registry

Open catalog of datasets used to train and align LLMs across pretraining, mid-training, and post-training.

Python 3 Updated Jan 6, 2026

ai-hero-dev / ai-hero

AI Hero's open-source examples and course material. Learn AI Engineering with a single repo.

TypeScript 1,520 292 Updated Jun 8, 2026

micytao / vllm-playground

A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and ent…

JavaScript 472 62 Updated Apr 7, 2026