Stars
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Simple static web-based mask drawer, supporting semantic segmentation and video segmentation with interactive Segment Anything Model 2 (SAM2).
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2/2.1+SAM3), MobileSAM!!
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Ultralytics YOLOv5 in PyTorch > ONNX > CoreML > TFLite
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Helm Chart of Matrix Alertmanager
COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning
Visualize PyTorch tensors with a single line of code.
youtubevos / MaskTrackRCNN
Forked from open-mmlab/mmdetectionMaskTrackRCNN for video instance segmentation based on mmdetection
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
A General Toolbox for Identifying Object Detection Errors
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Let your viewers become your unlimitedly scalable CDN.