Starred repositories
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
A natural language interface for computers
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
The definitive Web UI for local AI, with powerful features and easy setup.
aider is AI pair programming in your terminal
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monito…
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.
PyGWalker: Turn your dataframe into an interactive UI for visual analysis
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
An Autonomous LLM Agent for Complex Task Solving
Build effective agents using Model Context Protocol and simple workflow patterns
Home of StarCoder: fine-tuning & inference!
Auto detecting, masking and inpainting with detection model.
Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.
Build databases, automations, apps & agents with AI — no code. Open source platform available on cloud and self-hosted. GDPR, HIPAA, SOC 2 compliant. Best Airtable alternative.
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
AutoChain: Build lightweight, extensible, and testable LLM Agents
A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.
The way we interact with our data is changing.
[CVPR 2025] MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Make Llama2 use Code Execution, Debug, Save Code, Reuse it, Access to Internet
[TVCG'2023] AOT-GAN for High-Resolution Image Inpainting (codebase for image inpainting)