Stars
Posts related to my YouTube videos
ROS wrapper for DM-VIO: Delayed Marginalization Visual-Inertial Odometry
An open source platform for visual-inertial navigation research.
This is the pytorch implement of our paper "RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model"
FFA-Net: Feature Fusion Attention Network for Single Image Dehazing
[IEEE TGRS 2024] ChangeMamba: Remote Sensing Change Detection Based on Spatio-Temporal State Space Model
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
Self-study on Larry Wasserman's "All of Statistics"
GeoAI: Artificial Intelligence for Geospatial Data
Famous Vision Language Models and Their Architectures
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
Efficient Multimodal Large Language Models: A Survey
This repository contains codes for fine-tuning LLAVA-1.6-7b-mistral (Multimodal LLM) model.
[2025] Efficient Vision Language Models: A Survey
A simple way to calibrate your neural network.
small example on how to get SVO (subject, verb, object) information from an input, as well as whether that input was a question.
Faithfulness and factuality annotations of XSum summaries from our paper "On Faithfulness and Factuality in Abstractive Summarization" (https://www.aclweb.org/anthology/2020.acl-main.173.pdf).
Official PyTorch implementation of StyleGAN3
In this notebook, I show that SIFT is robust but not invariant to perspective view of the camera
In this project, Panorama of simple video is created from scratch by Python and Opencv
An introduction to Augmented Reality world