Stars
Linux 平台下基于 Rust + GTK 开发的网易云音乐播放器
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model
[AAAI'26 Oral] DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping
This is a Next.js, Tailwind CSS blogging starter template. Comes out of the box configured with the latest technologies to make technical writing a breeze. Easily configurable and customizable. Per…
A clean and beautiful Hugo theme, which built using Tailwind CSS.
CoTracker is a model for tracking any point (pixel) on a video.
ICRA 2022 "Hybrid Physical Metric For 6-DoF Grasp Pose Detection"
[CVPR 2025 Best Paper Nomination] FoundationStereo: Zero-Shot Stereo Matching
[CVPR 2025] Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Text-audio foundation model from Boson AI
[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Official code for TimeCraft: A Time Series Generation Framework for Real-World Applications
This is the official code of VideoAgent: A Memory-augmented Multimodal Agent for Video Understanding (ECCV 2024)
Using OnnxRuntime to inference yolov10,yolov10+SAM ,yolov10+bytetrack , SAM2 and paddleOCR by c++ .
woct0rdho / triton-windows
Forked from triton-lang/tritonFork of the Triton language and compiler for Windows support and easy installation
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
Enjoy the magic of Diffusion models!
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Dead simple FLUX LoRA training UI with LOW VRAM support
[Not Official] Implementation of TC-Resnet, INTERSPEECH 2019
[TITS 2024] You Only Look Clusters for Tiny Object Detection in Aerial Images