RAG Notebooks - Advanced Retrieval-Augmented Generation Systems

The purpose of this repository is thus to provide a comprehensive and dynamic hub to showcase the latest retrieval augmented generation (RAG) systems, and to enhance their error reduction as well as processing speed and contextual richness. It’s the go-to resource for those looking to become an RAG master — an assembly of curated Jupyter notebooks for learning and teaching RAG concepts. If you’re trying your hand at retrieval-based approaches to improve AI models or are just a fungi out for some advanced RAG methods, this repository is your thing.

Introduction

The combination of generative AI and information retrieval using Retrieval-Augmented Generation (RAG) is a scientific and technological revolution. This repository features a hand-collated set of advanced techniques to greatly enhance your RAG system’s ability to provide more accurate, relevant, and overall system data.

You can find all the course notebooks in the notebooks directory. These notebooks cover various aspects of building and fine-tuning RAG models, providing both theoretical background and practical, hands-on examples.

We aim to create a useful resource for the research and practitioner community to take the RAG to the uttermost extreme. Together we hope to accelerate innovation in this fabulous area by fostering a collaborative environment.

Key Features

🧠 State of the art RAG enhancements. 📚 Each technique has it’s own documentation. 🛠️ Practical implementation guidance. 🌟 Latest advancements, regular updates.

Running the Notebooks

You have two options for running the code in these notebooks:

Run Locally: You can clone the repository and run the notebooks on your local machine. To do this, ensure you have a Python installation with the necessary dependencies.

Run on Google Colab: Each notebook includes a link at the top to open it directly in Google Colab, making it easy to run without local setup.

01-Basic_Tutor.ipynb - Introduction to RAG fundamentals
02-Basic_RAG.ipynb - Basic RAG implementation
03-RAG_with_LlamaIndex.ipynb - Using LlamaIndex in RAG systems
04-RAG_with_VectorStore.ipynb - Integrating vector stores for enhanced retrieval
05-Improve_Prompts_+_Add_Source.ipynb - Refining prompts and adding source attribution
06-Evaluate_RAG.ipynb - Evaluating RAG performance
07-RAG_Improve_Chunking.ipynb - Optimizing text chunking
08-Finetune_Embedding.ipynb - Fine-tuning embeddings
10-Adding_Reranking.ipynb - Adding reranking mechanisms
11-Adding_Hybrid_Search.ipynb - Implementing hybrid search techniques
12-Improve_Query.ipynb - Improving query quality
13-Adding_Router.ipynb - Adding routing mechanisms
14-Adding_Chat.ipynb - Incorporating chat functionalities
15-Use_OpenSource_Models.ipynb - Using open-source models in RAG
17-Using_LLMs_to_rank_chunks_as_the_Judge.ipynb - Ranking chunks with LLMs
Advanced_Retriever.ipynb - Advanced retrieval methods
Agents_with_OpenAI_Assistants.ipynb - Using agents with OpenAI assistants
Audio_and_Realtime.ipynb - Audio and real-time processing
Basic_Agent_Example.ipynb - Example of a basic agent
Cohere_Model - Using Cohere model
Open source Cohere model- Using open source Cohere model
Website Crawl- Crawl a website using RAG
Dalle + Elevenlabs- Image and Voice generation
Evaluating_and_Iterating_Prompts- Evaluate and Prompt Iteration
GPT_4o_mini_Fine_Tuning- Fine tunning GPT4o
GraphRAG_Implementation- All about GraphRag
HF_Inference- Using HF_Inference
Knowledge_Base_for_RAG- Knowledge_Base_for_RAG
Larger_Context_Larger_N- Larger_Context model
Limitations_and_weaknesses_of_LLMs- Limitations_and_weaknesses_of_LLMs
LlamaIndex_101-LlamaIndex
LlamaParse- LlamaParse
Long_Context_Caching_vs_RAG- Long_Context_Caching_vs_RAG
Metadata_Filtering- Data Filtering
More_Api_And_Tools- Multiple tools of RAG
Observablity_And_Tracing- Observation of workflow with tracking
Open_source_BetterEmbedding_Model- Open_source_BetterEmbedding_Model
Perplexity_Web_Api- Using Perplexity
Prompting_101- Prompting
RAG_101- RAG
Structured(JSON)_PDF_Data_Extraction - PDF extraction
Web_Search_API- Using Web api for search

About This Repository

- Audience: Designed for students and professionals interested in AI and natural language processing.

- Topics Covered: The notebooks cover foundational and advanced concepts in retrieval-augmented generation, including:

Data retrieval techniques
Model integration with retrieval systems
Practical applications of RAG in real-world scenarios

Getting Started

Clone the repository and explore the notebooks at your own pace. Whether running them locally or in Colab, these notebooks will guide you step-by-step, enhancing your learning experience.

Contributing

Contributions are welcome! If you have suggestions for improvements or new notebooks, please open an issue or submit a pull request.

License

This project is licensed under the Apache-2.0 License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
Images		Images
01-Basic_Tutor.ipynb		01-Basic_Tutor.ipynb
02-Basic_RAG.ipynb		02-Basic_RAG.ipynb
03-RAG_with_LlamaIndex.ipynb		03-RAG_with_LlamaIndex.ipynb
04-RAG_with_VectorStore.ipynb		04-RAG_with_VectorStore.ipynb
05-Improve_Prompts_+_Add_Source.ipynb		05-Improve_Prompts_+_Add_Source.ipynb
06-Evaluate_RAG.ipynb		06-Evaluate_RAG.ipynb
07-RAG_Improve_Chunking.ipynb		07-RAG_Improve_Chunking.ipynb
08-Finetune_Embedding.ipynb		08-Finetune_Embedding.ipynb
10-Adding_Reranking.ipynb		10-Adding_Reranking.ipynb
11-Adding_Hybrid_Search.ipynb		11-Adding_Hybrid_Search.ipynb
12-Improve_Query.ipynb		12-Improve_Query.ipynb
13-Adding_Router.ipynb		13-Adding_Router.ipynb
14-Adding_Chat.ipynb		14-Adding_Chat.ipynb
15-Use_OpenSource_Models.ipynb		15-Use_OpenSource_Models.ipynb
17-Using_LLMs_to_rank_chunks_as_the_Judge.ipynb		17-Using_LLMs_to_rank_chunks_as_the_Judge.ipynb
Advanced_Retriever.ipynb		Advanced_Retriever.ipynb
Agents_with_OpenAI_Assistants.ipynb		Agents_with_OpenAI_Assistants.ipynb
Audio_and_Realtime.ipynb		Audio_and_Realtime.ipynb
Basic_Agent_Example.ipynb		Basic_Agent_Example.ipynb
Cohere_Better_Embedding_Model.ipynb		Cohere_Better_Embedding_Model.ipynb
Cohere_and_Open_Source_Embedding_Model.ipynb		Cohere_and_Open_Source_Embedding_Model.ipynb
Crawl_a_Website.ipynb		Crawl_a_Website.ipynb
DallE_3_and_ElevenLabs.ipynb		DallE_3_and_ElevenLabs.ipynb
Evaluating_and_Iterating_Prompts.ipynb		Evaluating_and_Iterating_Prompts.ipynb
GPT_4o_mini_Fine_Tuning.ipynb		GPT_4o_mini_Fine_Tuning.ipynb
GraphRAG_Implementation.ipynb		GraphRAG_Implementation.ipynb
HF_Inference.ipynb		HF_Inference.ipynb
Knowledge_Base_for_RAG.ipynb		Knowledge_Base_for_RAG.ipynb
LICENSE		LICENSE
Larger_Context_Larger_N.ipynb		Larger_Context_Larger_N.ipynb
Limitations_and_weaknesses_of_LLMs.ipynb		Limitations_and_weaknesses_of_LLMs.ipynb
LlamaIndex_101.ipynb		LlamaIndex_101.ipynb
LlamaParse.ipynb		LlamaParse.ipynb
Long_Context_Caching_vs_RAG.ipynb		Long_Context_Caching_vs_RAG.ipynb
Metadata_Filtering.ipynb		Metadata_Filtering.ipynb
More_Api_And_Tools.ipynb		More_Api_And_Tools.ipynb
Observablity_And_Tracing.ipynb		Observablity_And_Tracing.ipynb
Open_source_BetterEmbedding_Model.ipynb		Open_source_BetterEmbedding_Model.ipynb
Perplexity_Web_Api.ipynb		Perplexity_Web_Api.ipynb
Prompting_101.ipynb		Prompting_101.ipynb
RAG_101.ipynb		RAG_101.ipynb
README.md		README.md
Structured(JSON)_PDF_Data_Extraction.ipynb		Structured(JSON)_PDF_Data_Extraction.ipynb
Web_Search_API.ipynb		Web_Search_API.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG Notebooks - Advanced Retrieval-Augmented Generation Systems

Introduction

Key Features

Running the Notebooks

Table of Contents

About This Repository

- Audience: Designed for students and professionals interested in AI and natural language processing.

- Topics Covered: The notebooks cover foundational and advanced concepts in retrieval-augmented generation, including:

Getting Started

Contributing

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

Aadya-Madankar/RAG_Notebooks

Folders and files

Latest commit

History

Repository files navigation

RAG Notebooks - Advanced Retrieval-Augmented Generation Systems

Introduction

Key Features

Running the Notebooks

Table of Contents

About This Repository

- Audience: Designed for students and professionals interested in AI and natural language processing.

- Topics Covered: The notebooks cover foundational and advanced concepts in retrieval-augmented generation, including:

Getting Started

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages