Research Assistant @ MBZUAI Β· Incoming M.S. Student Β· B.Tech ECE, VIT Vellore
Building vision-language models for geospatial understanding
I am a researcher working at the intersection of computer vision, vision-language models, and geospatial AI. Currently a Research Assistant at the Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI), where I am building Tera-SAM β an agentic segmentation pipeline for zero-shot instance segmentation of satellite imagery using SAM3 and VLMs.
I completed my B.Tech in Electronics and Communication Engineering from VIT Vellore (GPA: 8.29) and am preparing to begin graduate studies. My work spans the full stack from model design to edge deployment on Raspberry Pi and NVIDIA Jetson hardware.
Current focus β VLMs for geospatial segmentation Β· Agentic AI Β· Remote sensing
Past work β IoT security Β· Video restoration Β· Edge CV Β· Assistive tech
| Period | Role | Institution |
|---|---|---|
| Dec 2025 β Present | Research Assistant | Mohamed Bin Zayed University of AI (MBZUAI) |
| May 2025 β Sep 2025 | Research Intern | Indian Institute of Technology, Guwahati |
| Jan 2024 β Aug 2024 | Research Intern | National University of Singapore |
| Jun 2024 β Jul 2024 | Research Intern | Indian Institute of Technology, Dharwad |
| Mar 2024 β Jun 2024 | Computer Vision Intern | Vellore Institute of Technology |
| Sep 2023 β Nov 2023 | Data Science Intern | Quick Heal Technologies |
π AIoT Enhanced Night Vision Object Detection for Vehicle Safety β IEEE NETAPPS 2024
π Optimizing IoT Security: A ML Pipeline for Fast Intrusion Detection β IEEE NETAPPS 2024
π VocalEyes: Vision-Language Models for the Visually Impaired β IEEE ICEI 2024 @ IIT Dharwad
π NoiseFed: Federated Learning for Breast Cancer Detection β Preprint 2024
| Project | Description | Status |
|---|---|---|
| TerraSAM | Agentic VLM pipeline for geospatial segmentation (SAM3 + LLM) | π’ Ongoing |
| Video Restoration | Weather-degraded video restoration β Swin + Restormer (+5% PSNR) | π΅ Research |
| DarkRoad IoT Guardian | Night-vision object detection on Raspberry Pi 5 β IEEE Published | β Complete |
| NetSentry IoT | ML-driven network intrusion detection β 0.952 accuracy β IEEE Published | β Complete |
| VocalEyes | Assistive vision system for the visually impaired using Florence-2 on Jetson | β Complete |
Languages
Frameworks & Libraries
Hardware & Platforms
Certifications