D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
-
Updated
Dec 19, 2025 - Python
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
tangmin's homepage
Rewriting the I3D blender addon from scratch and adding long-sought community features
Sign Language Recognition (I3D + Transformers) on WLASL Dataset - Computer Vision Final Project (CS-GY 6643)
Terraform i3D.net provider
PySlowFast, the official video understanding framework from Facebook AI Research (FAIR), to train, evaluate, and reproduce state-of-the-art video models on the UCF24 action detection dataset. It supports customizable training pipelines, model fine-tuning, and evaluation for video-based action recognition and spatio-temporal localization tasks.
A real-time inferencing of multistreaming YOWOv3(Spatio Temporal Action Detection task) using (UCF101-24) dataset. The repo is extension of https://github.com/Hope1337/YOWOv3, https://arxiv.org/pdf/2408.02623
This project focuses on developing an intelligent security system designed for environments such as malls, hospitals, schools, and offices.
LS2009 repo with unpacked dataS.gar files, scripts, source codes for LS2009Chat, HideHUD and more...
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and TIMM models.
Providing game servers the ability to communicate with the i3D.net ONE Platform.
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Add a description, image, and links to the i3d topic page so that developers can more easily learn about it.
To associate your repository with the i3d topic, visit your repo's landing page and select "manage topics."