Lists (1)
Sort Name ascending (A-Z)
Starred repositories
OpenMMD is an OpenPose-based application that can convert real-person videos to the motion files (.vmd) which directly implement the 3D model (e.g. Miku, Anmicius) animated movies.
Enjoy the magic of Diffusion models!
[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.
[CVPR2026]RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation
Light Image Video Generation Inference Framework
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
Wan: Open and Advanced Large-Scale Video Generative Models
Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944
Open-Sora: Democratizing Efficient Video Production for All
HunyuanVideo-1.5: A leading lightweight video generation model
The awesome collection of OpenClaw skills. 5,400+ skills filtered and categorized from the official OpenClaw Skills Registry.🦞
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
ORB-SLAM3: An Accurate Open-Source Library for Visual, Visual-Inertial and Multi-Map SLAM
YOLO26 RKNN Ultralytics 🚀
Decoupled Memory Selection for Multi-target Video Segmentation of SAM3
MOT using deepsort and yolov3 with pytorch
Official code for the AAAI 2026 paper ”Spatio-Temporal Context Learning with Temporal Difference Convolution for Moving Infrared Small Target Detection“
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Wan: Open and Advanced Large-Scale Video Generative Models
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
yolov8 车牌检测 车牌识别 中文车牌识别 检测 支持12种中文车牌 支持双层车牌
Pytorch Implementation For LPRNet, A High Performance And Lightweight License Plate Recognition Framework.