iVA: Intelligent Video Analytics

iVA is a real-time web application that processes a live camera feed to perform object detection, text extraction (OCR), and AI-driven contextual analysis. It features a dual-model system, allowing users to switch between general object detection and specific, text-based object searching.

Features ✨

Dual Detection Modes:
- YOLOv8: For high-performance, general-purpose object detection (80 classes).
- Grounding DINO (Placeholder): For specific, "open-vocabulary" detection based on a user's text prompt (e.g., "a person wearing a hat").
Real-time Bounding Boxes: Draws boxes and labels around detected objects directly on the video feed.
Text Extraction (OCR): Uses Tesseract to read text visible in the video stream.
AI Scene Description: Leverages the Google Gemini API to generate intelligent descriptions for important scenes.
Optimized Asynchronous Logging: A background worker intelligently buffers analysis results to a temporary log file, then selects and enriches only the most "reliable" log from each time window to save to a SQL Server database, minimizing API costs and database load.
Interactive UI: A clean interface with controls to pause/play the video feed and switch between detection modes.

Technology Stack 💻

Backend:
- ASP.NET Core (.NET 9)
- C#
- Entity Framework Core
- SQL Server
- Microsoft.ML.OnnxRuntime (for YOLOv8 inference)
- Google.Ai.Generativelanguage (Official Gemini SDK)
- Tesseract.NET (for OCR)
Frontend:
- HTML5
- Tailwind CSS
- Vanilla JavaScript

Setup and Configuration ⚙️

1. Prerequisites

.NET 9 SDK
SQL Server (e.g., Express or Developer edition)

2. Required Files

You must place the following files and folders in the root directory of the C# project:

yolov8n.onnx: The YOLOv8 model file.
- Download from: https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n.onnx
tessdata folder: This folder must contain the Tesseract language data.
- Create a folder named tessdata.
- Download eng.traineddata from here and place it inside the tessdata folder.

(The .csproj file is already configured to copy these files to the output directory when you build the project.)

3. Backend Configuration

Open the appsettings.json file.
Update the ConnectionStrings.DefaultConnection value to point to your SQL Server instance.
Update the Gemini.ApiKey value with your Google Gemini API key.
Open a terminal in the project root and run the database migration to create the necessary tables:
```
dotnet ef database update
```

How to Run 🚀

Open a terminal in the project's root directory.
Run the application using the .NET CLI:
```
dotnet run
```
The terminal will display the URL the application is running on (e.g., https://localhost:7123). Open this URL in your web browser.

How to Use the Application 🕹️

When the page loads, your browser will ask for permission to use your camera. Click Allow.
The application will start in the default "General Detection (YOLO)" mode, automatically identifying and boxing common objects.
To search for a specific object, select the "Specific Search (DINO)" radio button and type a description (e.g., "a blue cup") into the text box.
Use the Pause and Play buttons to control the video feed and the analysis process.
The analytics panel on the right will update with the results from the live analysis. The background worker will save the most relevant logs to the database every 10 seconds.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
Controllers		Controllers
Hubs		Hubs
Migrations		Migrations
Models		Models
Properties		Properties
Services		Services
Worker		Worker
nets		nets
tessdata		tessdata
wwwroot		wwwroot
.gitattributes		.gitattributes
.gitignore		.gitignore
Program.cs		Program.cs
README.md		README.md
appsettings.Development.json		appsettings.Development.json
iVA.csproj		iVA.csproj
iVA.http		iVA.http
iVA.sln		iVA.sln

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iVA: Intelligent Video Analytics

Features ✨

Technology Stack 💻

Setup and Configuration ⚙️

1. Prerequisites

2. Required Files

3. Backend Configuration

How to Run 🚀

How to Use the Application 🕹️

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

iVA: Intelligent Video Analytics

Features ✨

Technology Stack 💻

Setup and Configuration ⚙️

1. Prerequisites

2. Required Files

3. Backend Configuration

How to Run 🚀

How to Use the Application 🕹️

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages