Stars
Command-line program to download videos from YouTube.com and other video sites
Models and examples built with TensorFlow
The world's simplest facial recognition api for Python and the command line
A collection of design patterns/idioms in Python
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Image augmentation for machine learning experiments.
Hierarchical Reasoning Model Official Release
A collaboration friendly studio for NeRFs
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Refine high-quality datasets and visual AI models
Deep learning library featuring a higher-level API for TensorFlow.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Python library for audio and music analysis
Accessible large language models via k-bit quantization for PyTorch.
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Progressive Growing of GANs for Improved Quality, Stability, and Variation
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Keras implementation of RetinaNet object detection.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
PyTorch code and models for V-JEPA self-supervised learning from video.