Skip to content
View YHWH666's full-sized avatar

Block or report YHWH666

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,805 1,414 Updated Mar 3, 2026

[CVPR25] Official implementation of `MobileMamba: Lightweight Multi-Receptive Visual Mamba Network.'

Python 360 20 Updated Mar 20, 2025

The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".

Python 397 19 Updated Jun 23, 2025

[CVPR'25 (Highlight)] Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition

Jupyter Notebook 47 1 Updated Jun 24, 2025

Collection of awesome parameter-efficient fine-tuning resources.

590 19 Updated Dec 10, 2025

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image …

Python 1,060 148 Updated Aug 19, 2024

This is the official repository of the paper: CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown Degradations

Python 50 3 Updated Apr 13, 2024

[TIV-2025] Implementation for paper "Degradation Modeling for Restoration-enhanced Object Detection in Adverse Weather Scenes".

Python 20 4 Updated Mar 24, 2026

[ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“

Python 89 14 Updated Feb 10, 2025

SHIFT Dataset DevKit - CVPR2022

Python 117 10 Updated Jan 8, 2024

[COLM 2025] LoRI: Reducing Cross-Task Interference in Multi-Task Low-Rank Adaptation

Python 171 14 Updated Jul 8, 2025

YOLOE: Real-Time Seeing Anything [ICCV 2025]

Python 2,098 198 Updated Jun 26, 2025
Python 69 4 Updated Sep 11, 2024

The official implementation of "An Efficient and Mixed Heterogeneous Model for Image Restoration"

Python 55 Updated Sep 9, 2025

✨✨Latest Papers on Vision Mamba and Related Areas

381 19 Updated Apr 17, 2025

[CVPR 2025 Oral] OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels

Python 524 53 Updated Dec 25, 2025

Official repo for Adaptive Rectangular Convolution

Jupyter Notebook 183 8 Updated Jun 7, 2025

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 2,104 134 Updated Mar 11, 2026

[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence

Python 1,474 190 Updated Mar 24, 2026

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,977 323 Updated Jun 12, 2025

[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors

Python 2,840 417 Updated Feb 18, 2026

This is official implementtaion of "VmambaIR: Visual State Space Model for Image Restoration"

Python 223 8 Updated May 7, 2025

Pytorch implementation of CVPR 2025 paper, "MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration". The Code will be released very soon (Within 2 week)

Python 98 6 Updated Sep 12, 2025
Python 11 1 Updated Feb 22, 2024

Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Full Score, Highlight).

Python 411 31 Updated Dec 20, 2025

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 670 42 Updated Jul 22, 2024
Python 31 1 Updated May 31, 2024

[ACCV 2024] Source code of WalMaFa

Python 61 6 Updated Dec 4, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,770 2,534 Updated Mar 5, 2026

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,096 516 Updated Jan 6, 2026
Next