Stars
[CVPR 2025] official implementation of “Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly Detection”
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Academic Research Skills for Claude Code: research → write → review → revise → finalize
It's an app that lets you interact with a virtual pet on your desktop.
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc.
Batch download and stitch Google Street View tiles into panoramas using Maps Tile API
[ESSD 2025 & IEEE DFC 2025 & CVPRW 2026] Bright: A globally distributed multimodal VHR dataset for all-weather disaster response
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
The official Pytorch implementation of OCNet, OCRNet, and SegFix.
This repository contains the solution for the 1st Mars Landslide Segmentation Challenge. It implements M3LSNet (Mamba-based Multimodal Martian Landslide Segmentation Network).
This project is for NTIRE Workshop and Challenges @ CVPR 2025 on Day and Night Raindrop Removal for Dual-Focused Images
This is the repo of team UIT-SHANKS in NTIRE 2025 The First Challenge on Day and Night Raindrop Removal for Dual-Focused Images
A collaboration friendly studio for NeRFs
implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
[ECCV2024] "Raindrop Clarity: A Dual-Focused Dataset for Day and Night Raindrop Removal", https://arxiv.org/abs/2407.16957
open Multi-View Stereo reconstruction library
[CVPR 2026 Highlight] SegEarth-R2: Towards Comprehensive Language-guided Segmentation for Remote Sensing Images
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
A Simple, Lightweight, and Extensible Serving Framework for X-AnyLabeling
🔥 💪 Crack-Detection-and-Segmentation-Dataset-for-UAV-Inspection
YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework(Supports RGBT detection for all YOLO series from YOLOv3 to YOLOv13, as well as RTDETR. 【Ultralytics YOLOv…
A resource collection of RGBT Salient Object Detection
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
We write your reusable computer vision tools. 💜
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
CUBIT-Det: High-resolution Infrastructure Defect Detection Dataset