Stars
Command-line program to download videos from YouTube.com and other video sites
Models and examples built with TensorFlow
The world's simplest facial recognition api for Python and the command line
A collection of design patterns/idioms in Python
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Latex code for making neural networks diagrams
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
StableLM: Stability AI Language Models
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Image augmentation for machine learning experiments.
PyTorch code and models for the DINOv2 self-supervised learning method.
Hierarchical Reasoning Model Official Release
A collaboration friendly studio for NeRFs
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Refine high-quality datasets and visual AI models
Deep learning library featuring a higher-level API for TensorFlow.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Python library for audio and music analysis
Accessible large language models via k-bit quantization for PyTorch.
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
Progressive Growing of GANs for Improved Quality, Stability, and Variation