Stars
Compute substrate for AI agents: lightweight enough to live on your laptop, elastic enough to scale into the cloud and unleash unlimited resources.
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…
Multi-framework streaming Markdown renderers for AI apps: Vue/Nuxt, React/Next.js, Svelte, and Angular, with Mermaid, KaTeX, Shiki, Monaco, safe HTML, and low-jitter updates.
Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc
llama.cpp fork with additional SOTA quants and improved performance
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Get your documents ready for gen AI
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
🚀 The fast, Pythonic way to build MCP servers and clients.
🤗 smolagents: a barebones library for agents that think in code.
Visual testing tool for MCP servers
⌛ easy to use progress-bar for command-line/terminal applications
Display images in the terminal
⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
Concatenate a directory full of files into a single prompt for use with LLMs
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from vari…
Port of OpenAI's Whisper model in C/C++
Use your locally running AI models to assist you in your web browsing
Rembg is a tool to remove images background
A collection of common interactive command line user interfaces.
node.js command-line interfaces made easy
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
Parse files (e.g. code repos) and websites to clipboard or a file for ingestions by AI / LLMs
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference