Skip to content
Change the repository type filter

All

    Repositories list

    • SpikeTAD

      Public
      SpikeTAD: Spiking Neural Networks for End-to-End Temporal Action Detection
      Python
      MIT License
      0000Updated Mar 30, 2026Mar 30, 2026
    • Python
      1200Updated Mar 9, 2026Mar 9, 2026
    • LongVPO

      Public
      [NeurIPS 2025] LongVPO: From Anchored Cues to Self-Reasoning for Long-Form Video Preference Optimization
      Python
      0500Updated Mar 7, 2026Mar 7, 2026
    • Video-o3

      Public
      Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
      Python
      513740Updated Mar 6, 2026Mar 6, 2026
    • AMD

      Public
      [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
      Python
      11800Updated Jan 11, 2026Jan 11, 2026
    • SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
      Python
      Apache License 2.0
      3760380Updated Dec 23, 2025Dec 23, 2025
    • UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions
      Python
      34820Updated Dec 16, 2025Dec 16, 2025
    • SAM2-Plus

      Public
      SAM 2++: Tracking Anything at Any Granularity
      Python
      Apache License 2.0
      55900Updated Dec 15, 2025Dec 15, 2025
    • UniAVGen

      Public
      HTML
      0400Updated Dec 14, 2025Dec 14, 2025
    • [ICCV 2025] MobileViCLIP: An Efficient Video-Text Model for Mobile Devices
      Python
      MIT License
      11840Updated Dec 11, 2025Dec 11, 2025
    • PixNerd

      Public
      [ICLR 2026] PixNerd: Pixel Neural Field Diffusion
      Python
      MIT License
      617550Updated Dec 10, 2025Dec 10, 2025
    • FlowBack

      Public
      [AAAI 2026] Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment
      Python
      Apache License 2.0
      01610Updated Dec 9, 2025Dec 9, 2025
    • RGE

      Public
      Reasoning Guided Embeddings: Leveraging MLLM Reasoning for Improved Multimodal Retrieval
      Python
      01310Updated Nov 29, 2025Nov 29, 2025
    • JavaScript
      0200Updated Nov 25, 2025Nov 25, 2025
    • [NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory
      Python
      Apache License 2.0
      514660Updated Nov 4, 2025Nov 4, 2025
    • [TPAMI] JointFormer: A Unified Framework with Joint Modeling for Video Object Segmentation
      Python
      01200Updated Oct 21, 2025Oct 21, 2025
    • MeMOTR

      Public
      [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking
      Python
      MIT License
      2022340Updated Oct 15, 2025Oct 15, 2025
    • MotionRAG

      Public
      [NeurIPS 2025] MotionRAG: Motion Retrieval-Augmented Image-to-Video Generation
      Python
      MIT License
      52430Updated Oct 9, 2025Oct 9, 2025
    • [CVPR 2025] Online Video Understanding: OVBench and VideoChat-Online
      Python
      593100Updated Oct 7, 2025Oct 7, 2025
    • JavaScript
      0000Updated Oct 2, 2025Oct 2, 2025
    • CycleACR

      Public
      [TPAMI-2025] CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection
      Python
      Apache License 2.0
      0300Updated Sep 11, 2025Sep 11, 2025
    • DDT

      Public
      [CVPR 2026] DDT: Decoupled Diffusion Transformer
      Python
      1938050Updated Aug 22, 2025Aug 22, 2025
    • MOTIP

      Public
      [CVPR 2025] Multiple Object Tracking as ID Prediction
      Python
      Apache License 2.0
      42500100Updated Aug 20, 2025Aug 20, 2025
    • VideoEval

      Public
      VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
      Python
      01500Updated Jul 31, 2025Jul 31, 2025
    • Video-DC

      Public
      Python
      Apache License 2.0
      11110Updated Jul 30, 2025Jul 30, 2025
    • CaReBench

      Public
      A Fine-grained Benchmark for Video Captioning and Retrieval
      Python
      22740Updated Jul 16, 2025Jul 16, 2025
    • [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling
      Python
      MIT License
      02110Updated Jul 7, 2025Jul 7, 2025
    • p-MoD

      Public
      [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
      Python
      Apache License 2.0
      24310Updated Jun 26, 2025Jun 26, 2025
    • DEQDet

      Public
      [ICCV 2023] Deep Equilibrium Object Detection
      Jupyter Notebook
      12710Updated Jun 18, 2025Jun 18, 2025
    • SORCE

      Public
      Small Object Retrieval in Complex Environments (SORCE)
      Python
      1500Updated Jun 2, 2025Jun 2, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.