AI Tools & Prototypes Portfolio

Welcome to my portfolio of Artificial Intelligence tools and prototypes. This repository contains a collection of applications developed during my studies and as personal projects driven by my interest in generative AI and software development.

The projects range from command-line productivity utilities to full-stack web applications, demonstrating practical implementations of Large Language Models (LLMs), Computer Vision, and Audio Processing.

🚀 Featured Project

Smart Document Assistant (RAG)

Tech Stack: Python Streamlit LangChain OpenAI FAISS

A modern web application that solves the problem of information overload. It allows users to create a temporary knowledge base from multiple PDF documents and websites and interact with them using a chat interface.

Key Feature: Retrieval-Augmented Generation (RAG) to ground AI answers in factual data.
Use Case: Rapidly analyzing manuals, contracts, or long reports without reading every page.

🛠️ Project Collection

Here is an overview of the tools included in this repository:

Project	Type	Description	Key Tech
Scientific Article Generator	CLI Tool	Automates the creation of structured academic papers (Abstract to References) on any given topic. Outputs both Markdown and PDF.	Google Gemini xhtml2pdf
Multi-Modal Product Analyzer	CLI Tool	Takes product images as input and generates professional marketing descriptions and slogans.	Google Gemini Vision Argparse
Real-Time Voice Interpreter	Script	A "Universal Translator" that listens to speech, translates it, and speaks it back in the target language.	Whisper GPT-4 OpenAI TTS
Voice-to-Image Artist	Script	Converts spoken descriptions directly into digital art by chaining speech recognition with image generation.	Whisper DALL-E 3
LLM CLI Utility	CLI Tool	A "Swiss Army Knife" for developers to process various file formats (PDF, CSV, HTML) and URLs via command line using AI.	MarkItDown OpenAI

💻 Technologies & Skills

This portfolio demonstrates proficiency in the following areas:

Generative AI: OpenAI API (GPT-4, DALL-E, Whisper), Google Gemini API.
Frameworks: LangChain (Chains, Agents, Vector Stores), Streamlit.
Data Engineering: RAG Architectures, Vector Embeddings (FAISS), Web Scraping.
Python Development: API integration, CLI tool creation, Audio processing (PyAudio), PDF manipulation.

⚙️ Getting Started

Most tools in this repository require API keys (OpenAI or Google Cloud).

Clone the repository:
Install dependencies:
Each project has specific requirements. Generally, you can install the core libraries with:
pip install -r requirements.txt
Environment Setup:
Ensure you have your API keys set as environment variables:
- OPENAI_API_KEY
- GOOGLE_API_KEY (for Gemini projects)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
llm_cli_utility		llm_cli_utility
product_image_analyzer		product_image_analyzer
rag_document_assistant		rag_document_assistant
real_time_interpreter		real_time_interpreter
scientific_article_generator		scientific_article_generator
voice_to_image_generator		voice_to_image_generator
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Tools & Prototypes Portfolio

🚀 Featured Project

Smart Document Assistant (RAG)

🛠️ Project Collection

💻 Technologies & Skills

⚙️ Getting Started

About

Uh oh!

Releases

Packages

Languages

IzmoT/AI_Tools

Folders and files

Latest commit

History

Repository files navigation

AI Tools & Prototypes Portfolio

🚀 Featured Project

Smart Document Assistant (RAG)

🛠️ Project Collection

💻 Technologies & Skills

⚙️ Getting Started

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages