ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
-
Updated
Oct 23, 2023 - Python
ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
A curated list of different papers and datasets in various areas of audio-visual processing
An audio visualizer for React. Provides separate components to visualize both live audio and audio blobs.
Libvisual Audio Visualization
[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication
Human Emotion Understanding using multimodal dataset.
Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch
This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation
Browser-based AV signal flow diagram tool for broadcast, live production, and AV integration
Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
Source code for "Synchformer: Efficient Synchronization from Sparse Cues" (ICASSP 2024)
🎙 Generator waveform paths for SVG 🎶
Code-first patcher for exploring computation through audio and visual. Connect tools you know and code your own ✨
Transformer-based online speech recognition system with TensorFlow 2
[CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception
Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models
[🏆 IJCV 2025 & ACCV 2024 Best Paper Honorable Mention] Official pytorch implementation of the paper "High-Quality Visually-Guided Sound Separation from Diverse Categories"
Repository for BFI National Archive open source preservation workflow scripts
Audio-visual diarization pipeline used for creating VoxConverse dataset
Add a description, image, and links to the audio-visual topic page so that developers can more easily learn about it.
To associate your repository with the audio-visual topic, visit your repo's landing page and select "manage topics."