EdgeElite

EdgeElite addresses the real-world gap in context-aware productivity tools by acting as a real-time, on-device assistant that sees what you see (via OCR), hears what you hear (via ASR), and intelligently surfaces just-in-time suggestions—enabling fast, private, and personalized support without cloud reliance.

👩‍💻 Developers

Mansi Garg
Email: mansigar@usc.edu
Aryan Vij
Email: aryanv0213@berkeley.edu
Natalie Tang
Email: nattang@mit.edu
Ruthwika Gajjala
Email: ruthwika11@gmail.com
Brayden Mazepa
Email: braymazepa@gmail.com

⚙️ Setup Instructions

🔧 1. Install Dependencies

📦 Backend (Python 3.8+)

cd backend
python -m venv .venv
source .venv/bin/activate     # On Windows: .venv\Scripts\activate
pip install -r requirements.txt

💻 Frontend (Electron + React)

From the project root:

npm install      # or: yarn / pnpm install

🧠 2. Downloading Models

Place the below models in: backend/models/

➤ OCR Model

Download: EasyOCR
Note: Please make sure to download both the detector and recognizer models

➤ ASR Model

Download: Whisper-Large-V3-Turbo
Note: Please make sure to download both the Decoder and Encoder

🚀 Run the Application

▶️ Start the Backend

cd backend
uvicorn main:app --reload --host 0.0.0.0 --port 8000

This starts the FastAPI server at http://localhost:8000.

▶️ Start the Frontend

From the project root:

npm run start

This opens the Electron app with the React UI.

🧠 How to Use EdgeElite

Open the app
Launch EdgeElite to begin.
Start a session
EdgeElite will automatically begin listening through your device’s microphones.
Speech recognition
Your locally downloaded ASR (Automatic Speech Recognition) models will transcribe your speech in real-time and store the results in a searchable local database.
Capture your screen
Click the Screenshot button whenever you want to capture visual content from your screen.
Optical Character Recognition (OCR)
Your locally downloaded OCR models will extract text from the screenshot and store it alongside your audio data.
Recall past moments
Ask EdgeElite questions about things you've said or seen. It uses its database to retrieve relevant moments from your session history.
View & manage results
All recognized content is displayed in the UI. You can save, edit, or export results to other tools.

🗂 Project Structure

edgeelite/
├── backend/                    # FastAPI backend with AI services
│   ├── main.py                 # API entrypoint and routes
│   ├── asr.py                  # Audio/Speech Recognition
│   ├── llm.py                  # Large Language Model service
│   ├── ocr/                    # Optical Character Recognition
│   ├── storage/                # Data storage and retrieval
│   └── models/                 # AI model files (OCR, ASR, LLM)
├── renderer/                   # Next.js frontend application
├── main/                       # Electron main process
├── captures/                   # Screenshot storage
├── recordings/                 # Audio recording storage
├── docs/                       # Documentation
└── resources/                  # Application resources

📦 Requirements Summary

Component	Toolchain
Backend	Python 3.8+, FastAPI, Uvicorn, ONNX Runtime
Frontend	Node.js 18+, Electron, React
OCR Inference	EasyOCR, Pillow, OpenCV

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.vscode		.vscode
backend		backend
captures		captures
docs		docs
main		main
recordings		recordings
renderer		renderer
resources		resources
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
backend_api_documentation.md		backend_api_documentation.md
electron-builder.yml		electron-builder.yml
journal_executionplan.md		journal_executionplan.md
journal_instructions.md		journal_instructions.md
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
recall_executionplan.md		recall_executionplan.md
recall_instructions.md		recall_instructions.md
requirements.txt		requirements.txt
setup_qnn_env.ps1		setup_qnn_env.ps1
test_npu.py		test_npu.py
test_summarize.py		test_summarize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EdgeElite

👩‍💻 Developers

⚙️ Setup Instructions

🔧 1. Install Dependencies

📦 Backend (Python 3.8+)

💻 Frontend (Electron + React)

🧠 2. Downloading Models

➤ OCR Model

➤ ASR Model

🚀 Run the Application

▶️ Start the Backend

▶️ Start the Frontend

🧠 How to Use EdgeElite

🗂 Project Structure

📦 Requirements Summary

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

License

nattang/edgeelite

Folders and files

Latest commit

History

Repository files navigation

EdgeElite

👩‍💻 Developers

⚙️ Setup Instructions

🔧 1. Install Dependencies

📦 Backend (Python 3.8+)

💻 Frontend (Electron + React)

🧠 2. Downloading Models

➤ OCR Model

➤ ASR Model

🚀 Run the Application

▶️ Start the Backend

▶️ Start the Frontend

🧠 How to Use EdgeElite

🗂 Project Structure

📦 Requirements Summary

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages