Execute your local LLMs directly inside the IDE with zero context switching. Manage, configure, and chat with your Ollama models while maintaining absolute data privacy and sovereignty (your content never leaves your machine).
🎉 Thank you for checking it out! I feel honored that Ollama View has more than 9,000 downloads across marketplaces, before I even had a chance to properly announce it! 🎉
Note: This is an experimental project developed in my free time. While you might encounter some dragons along the way, you are highly encouraged to open issue tickets on GitHub if you find bugs or have feature requests. I'll do my best to take a look when I can!
|
Interactive Chat |
Advanced Model Setup |
You can install this extension from the following sources:
- VS Code Marketplace: Official release.
- Open VSX Registry: For Codium and other editors.
- GitHub Releases: Download the
.vsixfile manually.
- Dynamic Model List: View all your local Ollama models in a hierarchical tree view, grouping instances and chats.
- Model Instances: Create multiple named instances of the same model with distinct configurations.
- Pull Models: Easily download new models (e.g.,
llama3,mistral) via command. - Status Indication: Visual indicators for Running (Green) and Stopped (Grey) models.
- Persistent Chats: Create and manage multiple persistent chats for each model.
- Advanced Setup: Configure model-specific system messages, prompt injection templates (prefix/suffix framing), and performance controls (temperature, seed, top-p, etc.).
- Context Menu: Safe model deletion and configuration access via the "More Actions" menu.
- Concurrent Chats: Open multiple chat tabs simultaneously.
- History: Chat history is automatically saved and persists across VS Code sessions.
- Auto-Start: Sending a message to a stopped model continuously starts it.
- Message Editing: Hover over user messages to copy content or access edit options:
- Edit / Truncate: Edit a message and remove all subsequent history (rewriting the conversation path).
- Edit / Fork: Edit a message and create a new chat branch, preserving the original conversation.
- Model Actions: Hover over model responses to:
- Copy: Copy response content to clipboard.
- Regenerate: Re-generate the answer. If it's not the last message, this will truncate the conversation.
- Fork: Create a new chat branching from this point.
- Export Data: Easily export full chat histories to a JSON file for sharing or backup. Alternatively, chats can be exported to a lighter, human readable text file using the Markdown syntax.
- Prompt Debugging: Visibility toggle to show/hide "hidden" system turns and prompt injections in the chat history.
- Context-Aware Commands: Run Start/Stop/Delete from the Command Palette (
Ctrl+Shift+P) to see a interactive list of models if you haven't selected one in the view.
Ollama View operates entirely locally, ensuring absolute data privacy and sovereignty—your chat history, prompts, and model configurations never leave your machine.
Furthermore, we voluntarily align with the principles of the upcoming European Cyber Resilience Act (CRA). While this remains a personal open-source project provided "as-is", we aim for secure-by-design development practices:
- Zero Telemetry: No tracking, analytics, or external data collection.
- Deep Static Analysis: Our build pipeline utilizes AST-level static security testing (
JS-X-Ray) to detect suspicious code patterns. - Software Composition Analysis: We routinely audit our dependency graph using Google's
OSV-Scannerto intercept vulnerable upstream packages.
If you are interested in how the extension is built or want to contribute, please check out our Contributing Guidelines and Architecture overview.
- Ollama must be installed and running locally.
- By default, it connects to
http://127.0.0.1:11434.
Note: This project has been tested primarily on Linux. It assumes Ollama is already installed and running correctly on your system. Compatibility with other OSs is expected but not yet verified.
ollama-view: Refresh: Refresh the model list.ollama-view: Pull Model: Download a new model.ollama-view: Start: Start a model (context-aware).ollama-view: Stop: Stop a running model (context-aware).ollama-view: Delete: Delete a model (context-aware).
- Autocomplete for model names during "Pull" is limited to a popular subset. You can still type any model name manually.
I'm constantly improving the extension mostly based on my experiments and needs, but feel free to suggest new features or report issues on the GitHub Issues page. My current focus areas for upcoming releases include:
- Chat Import: Complementing our newly released Export feature, this will add capabilities to easily import existing chat histories back into the extension.
- Context Manipulation: Experimenting with ways to allow users to manually override past messages, exploring how altering the historical context shifts the model's follow-up responses.
If you encounter a bug, have a feature request, or just want to discuss an idea, please open a GitHub Issue. Community feedback is essential in driving the direction of this project!
If you find ollama-view useful in your daily workflow, please consider starring the repository on GitHub ⭐️ and reviewing it in the marketplace you downloaded it from. Your support means a lot and helps other users find the project!