Stars
PyTorch implementation of "Brain Decodes Deep Nets"
A Google ADK voice agent implementation using Gemini 2.5 Flash with Google Search, Filesystem access and PDF reading capabilities
[Algonauts 2023] PyTorch implementation of "Memory Encoding Model"
[WACV 2024] DREAM: Visual Decoding from Reversing Human Visual System
fMRI-to-image reconstruction on the NSD dataset.
Cross-platform, customizable ML solutions for live and streaming media.
MTCNN face detection implementation for TensorFlow, as a PIP package.
VMamba: Visual State Space Models,code is based on mamba
Code for paper LocalMamba: Visual State Space Model with Windowed Selective Scan
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
A demo for mapping class labels from ImageNet to COCO.
Official repository for CVPR24 Precognition Workshop Paper: VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting.
Applicability of scaling laws to vision encoding models
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
This is the official code repository for "MedMamba: Vision Mamba for Medical Image Classification"
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Implementation of Vision Mamba from the paper: "Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model" It's 2.8x faster than DeiT and saves 86.8% GPU memory wh…
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Implementation of a modular, high-performance, and simplistic mamba for high-speed applications
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.