Lists (3)
Sort Name ascending (A-Z)
Stars
RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of …
Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc
Sparse Inferencing for transformer based LLMs
Achieve state of the art inference performance with modern accelerators on Kubernetes
Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
World's first AI meeting copilot → The Invisible Companion for Work + Life
Instrument your FastAPI with Prometheus metrics.
Dynamic DNS Server with Web UI written in Go
SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.
letsencrypt/acme client implemented as a shell-script – just add water
DSPy: The framework for programming—not prompting—language models
A framework for few-shot evaluation of language models.
Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.
A curated list of awesome things related to FastAPI
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
Production-ready platform for agentic workflow development.
A compact LLM pretrained in 9 days by using high quality data
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The AI-native database built for LLM applications, providing incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text.
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.