The dataset used in this project can be found on Kaggle: President Biden Speech Dataset.
The goal is to develop a Q&A system capable of delivering informed responses on the content of President Biden's address, demonstrating RAG's application in combining semantic data retrieval with generative language models for NLP analysis in political discourse.
- Llama 2.0: Employs for generating text responses that mimic human conversation.
- ChromaDB: Acts as a vector database for storing and retrieving speech data semantically.
- LangChain: Orchestrates the interaction between Llama 2.0 and ChromaDB, facilitating the RAG process.
The system retrieves information from ChromaDB based on user queries about the SOTU address, then uses Llama 2.0 to generate responses. This ensures accuracy and broader contextual understanding, resulting in precise and enriched answers.
Ideal for researchers, journalists, educators, and the public for exploring political speeches via natural language queries. It showcases how retrieval and generation can be integrated in NLP to improve information access and understanding.
To get started with this project:
- Clone this repository to your local machine.
- Ensure you have Jupyter Notebook installed and running.
- Install the required dependencies.
- Download the "President Biden Speech Dataset" and place it in the designated directory.
- Open and run the Jupyter Notebook "RAG-BidenSpeech.ipynb" to train and evaluate the model.
We welcome contributions to enhance the functionality and efficiency of this script. Feel free to fork, modify, and make pull requests to this repository. To contribute:
- Fork the Project.
- Create your Feature Branch (
git checkout -b feature/AmazingFeature
). - Commit your Changes (
git commit -m 'Add some AmazingFeature'
). - Push to the Branch (
git push origin feature/AmazingFeature
). - Open a Pull Request against the
main
branch.
This project is licensed under the MIT License - see the LICENSE
file for details.
Author: Akhil Chhibber
LinkedIn: https://www.linkedin.com/in/akhilchhibber/
Medium Blogs: https://medium.com/@akhil.chibber