linkhut

In-depth tutorials on LLMs, RAGs and real-world AI agent applications. - ai-engineering-hub/fastest-rag-milvus-groq at main · patchy631/ai-engineering-hub

This project builds the fastest stack to build a RAG application with retrieval latency < 15ms.

It leverages binary quantization for efficient retrieval coupled with Groq’s blazing fast inference speeds.

by tmfnk 1 month ago

Tags:

27 Oct 25

Production RAG: what I learned from processing 5M+ documents

https://blog.abdellatif.io/production-rag-processing-5m-documents

Lessons learned from building RAG systems for Usul AI and enterprise clients, processing over 13 million pages.

by tmfnk 3 months ago

Tags:

24 Oct 25

athina-ai/rag-cookbooks: This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.

https://github.com/athina-ai/rag-cookbooks

This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems. The rag-cookbooks GitHub repository provides a collection of Jupyter notebooks offering tutorials, best practices, and practical use cases for implementing Retrieval Augmented Generation (RAG) systems.

by tmfnk 3 months ago

Tags:

19 Jun 25

Using the Azure CosmosDB NoSQL Vector Store connector

https://learn.microsoft.com/en-us/semantic-kernel/concepts/vector-store-connectors/out-of-the-box-connectors/azure-cosmosdb-nosql-connector?pivots=programming-language-csharp

https://github.com/hugobarona/banking-multi-agent-workshop/blob/main/csharp/src/MultiAgentCopilot/Models/Banking/OfferTerm.cs#L5
https://github.com/PennStateLefty/semantic-kernel/tree/8232075e5c73f1827514ee583b3f5104ddf087d7/dotnet/src/VectorDataIntegrationTests/CosmosNoSqlIntegrationTests/CRUD
https://github.com/seetampradhan/CosmosVectorSearch/tree/main#

by ciwchris 7 months ago

Tags:

12 May 25

ContextGem: Effortless LLM extraction from documents

https://github.com/shcherbak-ai/contextgem

by ciwchris 8 months ago

Tags:

rag

15 Mar 25

A powerful AI-powered research assistant that performs deep, iterative analysis using multiple LLMs and web searches

https://github.com/LearningCircuit/local-deep-research

by ciwchris 10 months ago

Tags:

08 Mar 25

RLAMA A powerful document question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems for all your document needs.

https://rlama.dev/

by ciwchris 11 months ago

Tags:

23 Feb 25

The Typescript AI framework - Mastra

https://mastra.ai/

Prototype and productionize AI features with a modern JS/TS stack

by chrisSt 11 months ago

Tags:

26 Jan 25

Anthropic’s new Citations API

https://simonwillison.net/2025/Jan/24/anthropics-new-citations-api/#atom-entries

by ciwchris 1 year ago saved 2 times

Tags:

02 Jan 25

An open-source RAG-based tool for chatting with your documents.

https://github.com/Cinnamon/kotaemon

by ciwchris 1 year ago saved 2 times

Tags:

ai
rag

18 Dec 24

A Personal NotebookLM and Perplexity-like AI Assistant for Everyone.

https://www.surfsense.net

by ciwchris 1 year ago

Tags:

03 Oct 24

A tool for chatting with your documents

https://quilt.fly.dev/

by ciwchris 1 year ago

Tags:

30 Jul 24

Turn any website into a knowledge base for LLMs

https://www.embedding.io

by ciwchris 1 year ago

Tags:

ai
rag

03 Jul 24

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

https://github.com/adithya-s-k/omniparse

by ciwchris 1 year ago

Tags:

ai
rag

30 May 24

What We Learned from a Year of Building with LLMs (Part I) – O’Reilly

https://www.oreilly.com/radar/what-we-learned-from-a-year-of-building-with-llms-part-i/

by chrisSt 1 year ago

Tags:

02 May 24

jina url to markdown

https://jina.ai/reader/

by vivo50 1 year ago saved 3 times

Tags:

10 Apr 24

Try out the Dot beta

https://dotapp.uk/

This is Dot, a standalone open source app meant for easy use of local LLMs and RAG in particular to interact with documents and files similarly to Nvidia’s Chat with RTX. Dot itself is completely standalone and is packaged with all dependencies including a copy of Mistral 7B, this is to ensure the app is as accessible as possible and no prior knowledge of programming or local LLMs is required to use it.

by chrisSt 1 year ago

Tags:

15 Jan 24

vanna-ai/vanna: 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

https://github.com/vanna-ai/vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄. - vanna-ai/vanna: 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

by chrisSt 2 years ago