Skip to content
View 1a1a11a's full-sized avatar

Highlights

  • Pro

Organizations

@cacheMon

Block or report 1a1a11a

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

334 results for source starred repositories
Clear filter

Examples, end-2-end tutorials and apps built using Liquid AI Foundational Models (LFM) and the LEAP SDK

Jupyter Notebook 1,070 161 Updated Feb 4, 2026

🕳 bore is a simple CLI tool for making tunnels to localhost

Rust 10,728 456 Updated Feb 4, 2026

OpenAI API-compatible wrapper for Claude Code

Python 379 61 Updated Jan 6, 2026

DedupBench is a benchmarking tool for content-defined chunking techniques used in data deduplication. It currently supports eleven unique CDC techniques and five different vector instruction sets.

C++ 20 1 Updated Oct 27, 2025

slime is an LLM post-training framework for RL Scaling.

Python 3,669 494 Updated Feb 5, 2026

DAOS Storage Stack (client libraries, storage engine, control plane)

C 912 341 Updated Feb 5, 2026

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 3,643 278 Updated Jan 16, 2026

A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology

C++ 1,340 182 Updated Dec 17, 2025

Python bindings for libCacheSim, designed for rapid experimentation with cache simulation models.

Python 6 2 Updated Oct 23, 2025

A framework for generating realistic LLM serving workloads

Python 99 7 Updated Oct 9, 2025

A single interface to use and evaluate different agent frameworks

Python 1,093 85 Updated Feb 4, 2026

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 35,256 5,636 Updated Feb 5, 2026

Zero instrucment LLM and AI agent (e.g. claude code, gemini-cli) observability in eBPF

C 193 27 Updated Nov 21, 2025

A comprehensive open-source cache trace dataset

Jupyter Notebook 20 2 Updated Aug 23, 2025

Lossless codec for numerical data

Rust 455 29 Updated Jan 31, 2026

a high performance library for building cache simulators

C++ 286 81 Updated Feb 2, 2026

Nano vLLM

Python 11,493 1,521 Updated Nov 3, 2025

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Python 404 58 Updated Jan 5, 2026

Huawei Cloud datasets

Jupyter Notebook 82 13 Updated Jan 8, 2026

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 618 69 Updated Apr 15, 2025

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 15,077 1,292 Updated Feb 4, 2026

Simple high-throughput inference library

Python 155 10 Updated May 14, 2025

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

Python 156 66 Updated Jan 20, 2026

A tiny yet powerful LLM inference system tailored for researching purpose. vLLM-equivalent performance with only 2k lines of code (2% of vLLM).

Python 314 35 Updated Jun 10, 2025

Composable building blocks to build LLM Apps

Python 8,258 1,259 Updated Feb 4, 2026

New file format for storage of large columnar datasets.

C++ 687 63 Updated Feb 4, 2026

Ollama Python library

Python 9,288 906 Updated Jan 23, 2026

A C implementation of the SIEVE cache eviction algorithm, based on the research paper (https://junchengyang.com/publication/nsdi24-SIEVE.pdf)

Makefile 3 Updated Jan 22, 2025
Next