Highlights
- Pro
Stars
[TPAMI 2018] Predicting the Driver’s Focus of Attention: the DR(eye)VE Project. A deep neural network learnt to reproduce the human driver focus of attention (FoA) in a variety of real-world drivin…
This repository contains the code to train and evaluate TRIBE v2, a multimodal model for brain response prediction
Interactive Retrieval Experiment Analysis Dashboard
PyTorch implementation of "Brain Decodes Deep Nets"
A Google ADK voice agent implementation using Gemini 2.5 Flash with Google Search, Filesystem access and PDF reading capabilities
[Algonauts 2023] PyTorch implementation of "Memory Encoding Model"
[WACV 2024] DREAM: Visual Decoding from Reversing Human Visual System
fMRI-to-image reconstruction on the NSD dataset.
MTCNN face detection implementation for TensorFlow, as a PIP package.
VMamba: Visual State Space Models,code is based on mamba
Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
A demo for mapping class labels from ImageNet to COCO.
Official repository for CVPR24 Precognition Workshop Paper: VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting.
Applicability of scaling laws to vision encoding models
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
This is the official code repository for "MedMamba: Vision Mamba for Medical Image Classification"
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.