28 Dec 25
In-depth tutorials on LLMs, RAGs and real-world AI agent applications. - ai-engineering-hub/fastest-rag-milvus-groq at main · patchy631/ai-engineering-hub
This project builds the fastest stack to build a RAG application with retrieval latency < 15ms.
It leverages binary quantization for efficient retrieval coupled with Groq’s blazing fast inference speeds.
27 Oct 25
Lessons learned from building RAG systems for Usul AI and enterprise clients, processing over 13 million pages.
24 Oct 25
This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems. The rag-cookbooks GitHub repository provides a collection of Jupyter notebooks offering tutorials, best practices, and practical use cases for implementing Retrieval Augmented Generation (RAG) systems.