Lists (9)
Sort Name ascending (A-Z)
Stars
Open-source IoT Platform - Device management, data collection, processing and visualization.
Toolkit for linearizing PDFs for LLM datasets/training
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
yolov7 车牌检测 车牌识别 中文车牌识别 检测 支持双层车牌 支持12种中文车牌
LightGlue: Local Feature Matching at Light Speed (ICCV 2023)
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A curated paper list of awesome skeleton-based action recognition.
[CVPR2024] Code for "SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation".
[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects
[3DV 2024 Oral] DeDoDe 🎶 Detect, Don't Describe --- Describe, Don't Detect, for Local Feature Matching
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A large-scale high-quality human dataset with rich multi-modal annotations
Open-source toolbox for visual fashion analysis based on PyTorch
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
🎥 Python and OpenCV-based scene cut/transition detection program & library.
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
SRS is a simple, high-efficiency, real-time media server supporting RTMP, WebRTC, HLS, HTTP-FLV, HTTP-TS, SRT, MPEG-DASH, and GB28181, with codec support for H.264, H.265, AV1, VP9, AAC, Opus, and …
Rembg is a tool to remove images background
A new one shot face swap approach for image and video domains