UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image …

Python 1,060 148 Updated Aug 19, 2024

zyw-stu / CPA-Enhancer

This is the official repository of the paper: CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations

Python 50 3 Updated Apr 13, 2024

xfwang23 / RDMNet

[TIV-2025] Implementation for paper "Degradation Modeling for Restoration-enhanced Object Detection in Adverse Weather Scenes".

Python 20 4 Updated Mar 24, 2026

SaFo-Lab / Dolphins

[ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“

Python 89 14 Updated Feb 10, 2025

SysCV / shift-dev

SHIFT Dataset DevKit - CVPR2022

Python 117 10 Updated Jan 8, 2024

juzhengz / LoRI

[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Python 171 14 Updated Jul 8, 2025

THU-MIG / yoloe

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 2,098 198 Updated Jun 26, 2025

sunshangquan / TransMamba

Python 69 4 Updated Sep 11, 2024

ClimBin / RestorMixer

The official implementation of "An Efficient and Mixed Heterogeneous Model for Image Restoration"

Python 55 Updated Sep 9, 2025

ReaFly / Awesome-Vision-Mamba

✨✨Latest Papers on Vision Mamba and Related Areas

381 19 Updated Apr 17, 2025

LMMMEng / OverLoCK

[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

Python 524 53 Updated Dec 25, 2025

WangXueyang-uestc / ARConv

Official repo for Adaptive Rectangular Convolution

Jupyter Notebook 183 8 Updated Jun 7, 2025

NVlabs / MambaVision

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 2,104 134 Updated Mar 11, 2026

Intellindust-AI-Lab / DEIM

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 1,474 190 Updated Mar 24, 2026

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,977 323 Updated Jun 12, 2025

sunsmarterjie / yolov12

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,840 417 Updated Feb 18, 2026

AlphacatPlus / VmambaIR

This is official implementtaion of "VmambaIR: Visual State Space Model for Image Restoration"

Python 223 8 Updated May 7, 2025

XLearning-SCU / 2025-CVPR-MaIR

Pytorch implementation of CVPR 2025 paper, "MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration". The Code will be released very soon (Within 2 week)

Python 98 6 Updated Sep 12, 2025

DreamerCCC / CutFreq

Python 11 1 Updated Feb 22, 2024

gszfwsb / NCFM

Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Full Score, Highlight).

Python 411 31 Updated Dec 20, 2025

sail-sg / lorahub

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 670 42 Updated Jul 22, 2024

Hhhhhhao / Conv-Adapter

Python 31 1 Updated May 31, 2024

mcpaulgeorge / WalMaFa

[ACCV 2024] Source code of WalMaFa

Python 61 6 Updated Dec 4, 2024

Wan-Video / Wan2.1

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,770 2,534 Updated Mar 5, 2026

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,096 516 Updated Jan 6, 2026

deep-reinforcement-learning

Jason Li YHWH666

Lists (28)

AI Framework

AI Security

All Weather Restoration

Automatic Vehicle

Conferences

CV

Datasets

Diffusion Model

Domain Adaptation

Embodied AI

Frontier

GNN

Image<->Text

KD

Lifelong Learning

LLM

macOS

masterpiece

Nijigen

Optimization

P&P Modules

RL

Self/Un Supervised

SNN

Tool

USL/FSL

Victims

XAI

Starred repositories

deep-reinforcement-learning