⚡️ Accelerate visual processing with FastVGGT, a training-free method that boosts the efficiency of Visual Geometry Transformers.
-
Updated
Sep 5, 2025 - Python
⚡️ Accelerate visual processing with FastVGGT, a training-free method that boosts the efficiency of Visual Geometry Transformers.
Implementation of the paper "SoftHGNN: Soft Hypergraph Neural Networks for General Visual Recognition".
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone
[ICLR 2024] Official PyTorch implementation of FasterViT: Fast Vision Transformers with Hierarchical Attention
Official Implementation of "Fine-Tuning is Fine, if Calibrated.", NeurIPS 2024
Official PyTorch Implementation of Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
This contains the codes for VR-Assignment1 - Coin Detection and Counting and Image Stitching
Build Change - Post-Disaster Rapid Response Retrofit. Following Build Change's main premise to Build Disaster Resistant Buildings and Change Construction Practices Permanently, PD3R Team's main objective is to improve the safety conditions of buildings and reduce human and economic loss after the occurrence of a natural disaster.
Data repository for "Signatures of the uncanny valley effect in an artificial neural network", Computers in Human Behavior, 2023
GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?
[ICML 2023] Official PyTorch implementation of Global Context Vision Transformers
This repository contains the ViewFool and ImageNet-V proposed by the paper “ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints” (NeurIPS2022).
Multimodal Prompting with Missing Modalities for Visual Recognition, CVPR'23
Improving Generalization via Scalable Neighborhood Component Analysis
A cute table-robot that helps you with your daily tasks
[ICLR 2023 Spotlight] GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation
Official PyTorch implementation of Fully Attentional Networks
Add a description, image, and links to the visual-recognition topic page so that developers can more easily learn about it.
To associate your repository with the visual-recognition topic, visit your repo's landing page and select "manage topics."