Skip to content
View KunalChavan245's full-sized avatar
πŸ’­
I may be slow to respond.
πŸ’­
I may be slow to respond.

Block or report KunalChavan245

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Kunalchavan245/README.md

Kunal Chavan

Research Assistant @ MBZUAI Β· Incoming M.S. Student Β· B.Tech ECE, VIT Vellore

Building vision-language models for geospatial understanding

Website LinkedIn Gmail Profile Views


About

I am a researcher working at the intersection of computer vision, vision-language models, and geospatial AI. Currently a Research Assistant at the Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI), where I am building Tera-SAM β€” an agentic segmentation pipeline for zero-shot instance segmentation of satellite imagery using SAM3 and VLMs.

I completed my B.Tech in Electronics and Communication Engineering from VIT Vellore (GPA: 8.29) and am preparing to begin graduate studies. My work spans the full stack from model design to edge deployment on Raspberry Pi and NVIDIA Jetson hardware.

Current focus  β†’  VLMs for geospatial segmentation Β· Agentic AI Β· Remote sensing
Past work      β†’  IoT security Β· Video restoration Β· Edge CV Β· Assistive tech

Research Experience

Period Role Institution
Dec 2025 – Present Research Assistant Mohamed Bin Zayed University of AI (MBZUAI)
May 2025 – Sep 2025 Research Intern Indian Institute of Technology, Guwahati
Jan 2024 – Aug 2024 Research Intern National University of Singapore
Jun 2024 – Jul 2024 Research Intern Indian Institute of Technology, Dharwad
Mar 2024 – Jun 2024 Computer Vision Intern Vellore Institute of Technology
Sep 2023 – Nov 2023 Data Science Intern Quick Heal Technologies

Selected Publications

πŸ“„ AIoT Enhanced Night Vision Object Detection for Vehicle Safety β€” IEEE NETAPPS 2024

πŸ“„ Optimizing IoT Security: A ML Pipeline for Fast Intrusion Detection β€” IEEE NETAPPS 2024

πŸ“„ VocalEyes: Vision-Language Models for the Visually Impaired β€” IEEE ICEI 2024 @ IIT Dharwad

πŸ“„ NoiseFed: Federated Learning for Breast Cancer Detection β€” Preprint 2024


Featured Projects

Project Description Status
TerraSAM Agentic VLM pipeline for geospatial segmentation (SAM3 + LLM) 🟒 Ongoing
Video Restoration Weather-degraded video restoration β€” Swin + Restormer (+5% PSNR) πŸ”΅ Research
DarkRoad IoT Guardian Night-vision object detection on Raspberry Pi 5 β€” IEEE Published βœ… Complete
NetSentry IoT ML-driven network intrusion detection β€” 0.952 accuracy β€” IEEE Published βœ… Complete
VocalEyes Assistive vision system for the visually impaired using Florence-2 on Jetson βœ… Complete

Tech Stack

Languages

Python C++ C SQL Java

Frameworks & Libraries

PyTorch TensorFlow HuggingFace OpenCV scikit-learn Ultralytics

Hardware & Platforms

Raspberry Pi NVIDIA Jetson AWS Linux

Certifications

AWS IBM


GitHub Stats


Open to research collaborations Β· Vision-Language Models Β· Geospatial AI Β· Edge Deployment

Website Β· LinkedIn Β· Email

Pinned Loading

  1. DarkRoad-IoT-Guardian DarkRoad-IoT-Guardian Public

    Jupyter Notebook

  2. NetSentry NetSentry Public

    Jupyter Notebook

  3. Work-at-IIT-G-on-Video-Restoration-of-Weather-Degraded-Videos Work-at-IIT-G-on-Video-Restoration-of-Weather-Degraded-Videos Public

    Python

  4. dotfiles dotfiles Public

    my hyprland files for next time installation

    Shell