28 Dec 25

In-depth tutorials on LLMs, RAGs and real-world AI agent applications. - ai-engineering-hub/fastest-rag-milvus-groq at main · patchy631/ai-engineering-hub

This project builds the fastest stack to build a RAG application with retrieval latency < 15ms.

It leverages binary quantization for efficient retrieval coupled with Groq’s blazing fast inference speeds.

by tmfnk 1 month ago

27 Oct 25

Lessons learned from building RAG systems for Usul AI and enterprise clients, processing over 13 million pages.

by tmfnk 3 months ago
Tags:

24 Oct 25

This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems. The rag-cookbooks GitHub repository provides a collection of Jupyter notebooks offering tutorials, best practices, and practical use cases for implementing Retrieval Augmented Generation (RAG) systems.

by tmfnk 3 months ago