🧠 OpenCluely

Core is working; improvements are shipping daily.

OpenCluely is a revolutionary AI-powered desktop application that provides invisible, real-time assistance during technical rounds.

🎬 Demo Video

OopenCluelyDemo.mp4

🌟 Why OpenCluely?

🥷 100% Stealth Mode

Invisible to Screen Sharing: Zoom, Teams, Meet, Discord
Process Disguise: Appears as normal system process (Terminal, Activity Monitor, Settings)
Click-Through Windows: Transparent overlay technology
Draggable UI: Move windows anywhere on screen
Zero Detection: Bypasses all recording software

🚀 AI-Powered Intelligence

Direct Image Analysis: Screenshots are analyzed by Gemini (no Tesseract OCR)
Voice Commands: Optional Azure Speech (Whisper questions, get instant answers)
Context Memory: Remembers entire interview conversation
Multi-Language Support: C++, Python, Java, JavaScript, C
Smart Response Window: Draggable with close button

🖼️ Modern UI Features

📱 Interactive Windows

Floating Overlay Bar: Compact command center with camera, mic, and skill selector
Draggable Answer Window: Move and resize AI response window anywhere
Close Button: Clean × button to close answer window when needed
Auto-Hide Mic: Microphone button appears only when Azure Speech is configured
Interactive Chat: Full conversation window with markdown support

🎨 Visual Design

Glass Morphism: Beautiful blur effects and transparency
Adaptive Layout: UI adjusts based on available services
Smart Resizing: Windows resize automatically to fit content
Professional Look: Mimics system applications for perfect stealth

🎯 Functional Overview

📋 Core Components

🖱️ Main Overlay

Floating command bar
Screenshot capture (⌘⇧S)
Microphone toggle (Optional)
Skill selector (DSA)
Language picker
Status indicator

💬 Interactive Chat

Real-time transcription
AI conversation
Markdown formatting
Session memory
Listening animations
Auto-scroll messages

📊 Answer Window

Draggable interface
Close button (×)
Split layout for code
Full markdown support
Syntax highlighting
Smart content sizing

✅ To-Do List & Development Status

🎯 Core Features (Completed)

🚧 Planned Features (In Development)

Hidden during screen share (auto‑hide all windows while screen is being shared)
Multi‑model support (OpenAI/Anthropic/Local backends alongside Gemini)
Auto‑typer for code snippets (paste or simulate typing into editors/IDEs)
Export conversation history (save sessions as markdown/PDF)
Performance optimizations (faster startup, reduced memory usage)
Enhanced stealth modes (process name randomization, deeper OS integration)

⚙️ Configuration

The setup script automatically handles configuration. You only need:

# Required: Google Gemini API Key (setup script will ask for this)
GEMINI_API_KEY=your_gemini_api_key_here

# Optional: Azure Speech Recognition (add later if you want voice features)
AZURE_SPEECH_KEY=your_azure_speech_key
AZURE_SPEECH_REGION=your_region

Note: Speech recognition is completely optional. If Azure credentials are not provided, the microphone button will be automatically hidden from all interfaces.

🚀 Quick Start & Installation

⚡ Three Simple Steps (All Operating Systems)

Clone the repository

git clone https://github.com/TechyCSR/OpenCluely.git
cd OpenCluely

Get your Gemini API key (Required)
- Visit Google AI Studio
- Click "Create API Key"
- Copy the key (you'll need it in step 3)
Run the setup script (One command does everything!)
```
./setup.sh
```

That's it! The setup script will:

Install all dependencies automatically
Create and configure your .env file
Build the app (if needed)
Launch OpenCluely ready to use (if not works use npm install & then npm start)

💻 Platform-Specific Notes

Windows: Use Git Bash (comes with Git for Windows), WSL, or any bash environment
macOS/Linux: Use your regular terminal
All platforms: No manual npm commands needed - the setup script handles everything

🎛️ Setup Script Options

./setup.sh --build          # Build distributable for your OS
./setup.sh --ci             # Use npm ci instead of npm install
./setup.sh --no-run         # Setup only, don't launch the app
./setup.sh --install-system-deps  # Install sox for microphone (optional)

🔧 Optional: Azure Speech Setup (For Voice Features)

Voice recognition is completely optional. The setup script will create a .env file with just the required Gemini key. To add voice features:

Get Azure Speech credentials:
- Visit Azure Portal
- Create a Speech Service
- Copy your key and region

Add to your .env file:

# Already configured by setup script
GEMINI_API_KEY=your_gemini_api_key_here

# Add these for voice features (optional)
AZURE_SPEECH_KEY=your_azure_speech_key
AZURE_SPEECH_REGION=your_region

Restart the app - microphone buttons will now appear automatically

🎮 How to Use

🖱️ Main Controls

Action	Shortcut	Description
Screenshot Capture	`⌘⇧S`	Capture screen and analyze via Gemini (image understanding)
Toggle Speech	`Alt+R`	Start/stop voice recognition (if configured)
Toggle Visibility	`⌘⇧V`	Show/hide all windows
Toggle Interaction	`⌘⇧I` or `Alt+A`	Enable/disable window interaction
Switch to Chat	`⌘⇧C`	Open interactive chat window
Settings	`⌘,`	Open settings panel

🎯 Workflow

Start OpenCluely → App appears as system process (Terminal/Activity Monitor)
Position Windows → Drag overlay and answer windows to preferred locations
Capture Questions → Use screenshot (⌘⇧S) or voice commands
Get AI Answers → Instant responses in draggable answer window
Interactive Chat → Type or speak for detailed conversations
Stay Stealth → All operations invisible to screen recording

🔧 Advanced Features

🎨 Window Management

Draggable Interface: Click and drag any window to reposition
Auto-resize: Windows automatically adjust to content
Close Button: Click × to close answer window
Always on Top: Windows stay above all applications

🧠 AI Intelligence

Context Awareness: Remembers entire conversation
Code Detection: Automatically formats code blocks
Language Specific: Tailored responses for selected programming language
Session Memory: Maintains context across multiple questions
Image Understanding: DSA prompt is applied only for new image-based queries; chat messages don’t include the full prompt
Multi-monitor & Area Capture: Programmatic APIs allow targeting a display and optional rectangular crop for focused analysis

🔊 Optional Voice Features (Azure Speech)

Real-time Transcription: Speak questions naturally
Listening Animation: Visual feedback during recording
Interim Results: See transcription as you speak
Auto-processing: Instant AI responses to voice input ]

🧩 Troubleshooting

Setup Issues

setup.sh not found or won't run

Make sure you're in the OpenCluely directory: cd OpenCluely

Make the script executable: chmod +x setup.sh

On Windows, use Git Bash (comes with Git for Windows)

Setup script stops with exit code 130

This means you pressed Ctrl+C. Just run ./setup.sh again

Node or npm not found

Install Node.js 18+ from nodejs.org

Restart your terminal and try again

App Issues

Electron won't start or shows blank window (Linux)

Try: npm run dev

Ensure X11/XWayland is available if running in headless environments

macOS screen capture doesn't work

Grant "Screen Recording" permission in System Settings → Privacy & Security → Screen Recording

Quit and relaunch the app after granting permission

Windows SmartScreen blocks the app

Click "More info" → "Run anyway" or use npm start during development

Microphone/voice not working

Voice is optional - ignore related warnings if you don't need it

To enable: install sox (Linux/macOS) and add Azure keys to .env

⚖️ Legal & Ethics

📋 Disclaimer

OpenCluely is provided for educational and research purposes. Users are responsible for:

Complying with interview guidelines

Respecting company policies

Understanding legal implications

Using ethically and responsibly

🔒 Privacy

No data collection or telemetry

All processing happens locally

API communications are encrypted

Session data stays on your device

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Google Gemini: Powering AI intelligence
Azure Speech: Optional voice recognition
Electron: Cross-platform desktop framework
Community: Amazing contributors and feedback
Vysper: UI and code structure inspiration — see Vysper by varun-singhh

⭐ Star this repo if OpenCluely helped you ace your interviews or you vibed with it!

Made with ❤️ by TechyCSR

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
assests		assests
lib		lib
prompts		prompts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
chat.html		chat.html
env.example		env.example
index.html		index.html
llm-response.html		llm-response.html
main.js		main.js
package-lock.json		package-lock.json
package.json		package.json
preload.js		preload.js
prompt-loader.js		prompt-loader.js
settings.html		settings.html
setup.sh		setup.sh
speech-recognition.js		speech-recognition.js
tailwind.config.js		tailwind.config.js

License

TechyCSR/OpenCluely

Folders and files

Latest commit

History

Repository files navigation

🧠 OpenCluely

🎬 Demo Video

🌟 Why OpenCluely?

🥷 100% Stealth Mode

🚀 AI-Powered Intelligence

🖼️ Modern UI Features

📱 Interactive Windows

🎨 Visual Design

🎯 Functional Overview

📋 Core Components

🖱️ Main Overlay

💬 Interactive Chat

📊 Answer Window

✅ To-Do List & Development Status

🎯 Core Features (Completed)

🚧 Planned Features (In Development)

⚙️ Configuration

🚀 Quick Start & Installation

⚡ Three Simple Steps (All Operating Systems)

💻 Platform-Specific Notes

🎛️ Setup Script Options

🔧 Optional: Azure Speech Setup (For Voice Features)

🎮 How to Use

🖱️ Main Controls

🎯 Workflow

🔧 Advanced Features

🎨 Window Management

🧠 AI Intelligence

🔊 Optional Voice Features (Azure Speech)

Setup Issues

App Issues

📋 Disclaimer

🔒 Privacy

📄 License

🙏 Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors 4

Uh oh!

Languages