Stars
The all-in-one, open-source backend platform for agentic coding. InsForge gives your coding agent database, auth, storage, compute, hosting, and AI gateway to ship full-stack apps end-to-end.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!
OCR model that handles complex tables, forms, handwriting with full layout.
Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.
Unified multimodal backend for AI data apps
An awesome & curated list of best LLMOps tools for developers
Easily fine-tune, evaluate and deploy Gemma 4, Qwen3.5, Qwen3.6, gpt-oss, DeepSeek-R1, or any open source LLM / VLM!
A Kubernetes deployable instance of GroundX for document parsing, storage, and search.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices
Command-line program to download videos from YouTube.com and other video sites
50+ tutorials and implementations for Generative AI Agent techniques, from basic conversational bots to complex multi-agent systems.
SwarmZero's SDK for building AI agents, swarms of agents and much more.
Stop renting your intelligence. Own it with AnythingLLM. Everything you need for a powerful local-first agent experience
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
OCR, layout analysis, reading order, table recognition in 90+ languages
A proxy server for multiple ollama instances with Key security
PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily write your own.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
Python hands on tutorial with 50+ Python Application (10 lines of code) By @xiaowuc2