🧠 Ollama Multimodal Chat Interface

A modern multimodal AI chat UI that supports text, voice, and image inputs, backed by Ollama for running LLMs locally. Built using React, Gradio, FastAPI (optional), and integrated with multiple LLMs (DeepSeek, Mistral, LLaMA3, etc.).

✨ Features

🔤 Text input: Ask questions or give instructions in plain text.
🎤 Voice input: Upload .wav audio or speak directly (speech-to-text using Google Speech API).
🖼️ Image input: Upload image files for OCR using easyocr.
🤖 Model selection: Choose from available Ollama models dynamically.
🗃️ Conversation history: Stored per-session with conversation_id.
💾 Export chat: Download chat as JSON.
🚀 Streaming replies: Simulated (can be expanded with websocket).
🧪 Model introspection: Auto-fetch from /api/tags and fall back to defaults.

🏗️ Tech Stack

Layer	Tool / Library
💻 Frontend	React + TypeScript + TailwindCSS
🧠 Backend	Python, Gradio, optionally FastAPI or Flask
🧱 UI Components	shadcn/ui, Lucide Icons
📡 API Calls	Gradio client, REST fetch, dynamic model resolver
🔍 OCR	`easyocr` (Python)
🎙️ Voice-to-text	`speech_recognition` using Google API
🔗 LLMs	Ollama (LLaMA3, Mistral, DeepSeek, Qwen, etc.)

🔧 Setup Instructions

Install dependencies:

   python -m venv venv
   venv\Scripts\activate.bat
   pip install gradio easyocr speechrecognition ollama
   npm i

Run App:
```
   npm start
```

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
chat_in_ui.py		chat_in_ui.py
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tailwind.config.js		tailwind.config.js
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Ollama Multimodal Chat Interface

✨ Features

🏗️ Tech Stack

🔧 Setup Instructions

About

Uh oh!

Languages

manavgoyal111/chat-with-llm

Folders and files

Latest commit

History

Repository files navigation

🧠 Ollama Multimodal Chat Interface

✨ Features

🏗️ Tech Stack

🔧 Setup Instructions

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages