This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
Self-supervised visual learning using momentum contrast in PyTorch
Towards Real-World Vision-Language Understanding
Official implementation of Watermark Anything with Localized Messages
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Main repository for Vispy
Benchmarking Multimodal Agents for Open-Ended Tasks
Official code for Style Aligned Image Generation via Shared Attention
Gemma open-weight LLM library, from Google DeepMind
Guiding Instruction-based Image Editing via Multimodal Large Language
TorchMultimodal is a PyTorch library
ICLR2024 Spotlight: curation/training code, metadata, distribution
PyTorch3D is FAIR's library of reusable components for deep learning
[CVPR 2025 Best Paper Award] VGGT
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences
About 24 Lessons, 12 Weeks, Get Started as a Web Developer
The book "Performance Analysis and Tuning on Modern CPU"
Django + shell_plus + Jupyter notebooks made easy
3D plotting and mesh analysis through a streamlined interface
A theme for Sublime Text 3 by Mattia Astorino
Python module that helps you build complex pipelines of batch jobs
Consistency Distilled Diff VAE
Task-oriented finetuning for better embeddings on neural search
Visual localization made easy with hloc