Skip to content
View fm1320's full-sized avatar

Block or report fm1320

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

⚡ Haystack + OpenSearch + Cognee — hybrid search, graph memory, streaming answers.

Python 5 Updated Jun 9, 2026

Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc

Go 4,619 351 Updated Jun 16, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,506 604 Updated Jun 16, 2026

Open-source inference server and production cluster for all the models your agent needs.

Python 2,052 183 Updated Jun 16, 2026

Community maintained hardware plugin for vLLM on Apple Silicon

Python 1,331 158 Updated Jun 16, 2026

rvLLM: High-performance LLM inference in Rust. Drop-in vLLM replacement.

Rust 744 69 Updated Jun 15, 2026

Comprehensive ML/AI interview codex with iterative system design, production-ready code, and 2026 standards. Includes LLM/GenAI, RAG systems, agentic AI, and algorithms from scratch.

Python 437 70 Updated Jan 16, 2026

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 25,177 2,375 Updated Jun 16, 2026

A framework for teaching AI to write like you. Not like a better version of you. Like you.

304 44 Updated Apr 13, 2026

The best-benchmarked open-source AI memory system. And it's free.

Python 55,735 7,223 Updated Jun 15, 2026

AI-powered job search system built on Claude Code. 14 skill modes, Go dashboard, PDF generation, batch processing.

JavaScript 54,204 10,758 Updated Jun 16, 2026

A vector index built on TurboQuant, written in Rust with Python bindings

Python 11,768 1,023 Updated Jun 10, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,084 18,126 Updated Jun 16, 2026

Code supporting the "Beyond Linearity in Attention Projections: The Case for Nonlinear Queries" paper

Jupyter Notebook 2 1 Updated May 26, 2026

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …

Go 16,335 1,310 Updated Jun 16, 2026

A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.

Python 21,473 1,132 Updated Jun 16, 2026

AI agents running research on single-GPU nanochat training automatically

Python 87,182 12,627 Updated Mar 26, 2026

A collection of tricks and tools to speed up transformer models

TeX 207 13 Updated May 6, 2026

Become a cracked AI/ML Research Engineer

TypeScript 4,520 626 Updated Jun 14, 2026

Presentation Slides for Developers

TypeScript 47,217 2,103 Updated Jun 3, 2026

Material for gpu-mode lectures

Jupyter Notebook 6,185 623 Updated Jun 15, 2026

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,877 601 Updated Jun 16, 2026
Jupyter Notebook 23 2 Updated Feb 13, 2026

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 20,063 2,246 Updated Mar 10, 2026

Beads - A memory upgrade for your coding agent

Go 24,567 1,647 Updated Jun 16, 2026

AI agents can now use real Android and iOS apps, just like a human.

Python 2,602 223 Updated Jun 10, 2026

Recursive Language Models (RLMs) implementation based on the paper by Zhang, Kraska, and Khattab

Python 239 21 Updated Jan 3, 2026

Open catalog of datasets used to train and align LLMs across pretraining, mid-training, and post-training.

Python 3 Updated Jan 6, 2026

AI Hero's open-source examples and course material. Learn AI Engineering with a single repo.

TypeScript 1,520 292 Updated Jun 8, 2026

A modern web interface for managing and interacting with vLLM servers (www.github.com/vllm-project/vllm). Supports both GPU and CPU modes, with special optimizations for macOS Apple Silicon and ent…

JavaScript 472 62 Updated Apr 7, 2026
Next