-
RØDE microphones
- https://www.robots.ox.ac.uk/~jaesung/
- @huh_jaesung
-
ca-subtitle Public
Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"
-
look-listen-recognise Public
Dataset page for Look, Listen and Recognise : character-aware audio-visual subtitling (ICASSP 2024)
-
simple-subtitling Public
Character-aware audio-only subtitling
-
SimpleDiarization Public
Simple diarization model
-
av-diarization Public
Audio-visual diarization pipeline used for creating VoxConverse dataset
-
voice-gender-classifier Public
Voice gender classifier using ECAPA-TDNN
-
webpage_laurynas Public
Forked from karazijal/karazijal.github.ioAuto Generated (should be)
HTML UpdatedJan 17, 2025 -
ECAPA-TDNN Public
Forked from TaoRuijie/ECAPA-TDNNUnofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
Python MIT License UpdatedApr 11, 2024 -
VoxMovies Public
Evaluation script for VoxMovies dataset in PyTorch
-
-
EasyComDataset Public
Forked from facebookresearch/EasyComDatasetThe Easy Communications (EasyCom) dataset is a world-first dataset designed to help mitigate the *cocktail party effect* from an augmented-reality (AR) -motivated multi-sensor egocentric world view.
Other UpdatedJul 28, 2023 -
avobjects Public
Forked from afourast/avobjectsImplementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
Python MIT License UpdatedJun 30, 2023 -
-
TalkNet-ASD Public
Forked from TaoRuijie/TalkNet-ASDACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
Python MIT License UpdatedMay 29, 2023 -
jaesunghuh.github.io Public
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript MIT License UpdatedMay 26, 2023 -
ego_actrecog_analysis Public
Forked from beasteers/ego_actrecog_analysisPython Other UpdatedFeb 25, 2023 -
-
laughter-detection Public
Forked from jrgillick/laughter-detectionPython MIT License UpdatedJun 15, 2022 -
VoxSRC2021 Public
Forked from a-nagrani/VoxSRC2020Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2021
-
voxceleb_trainer Public
Forked from clovaai/voxceleb_trainerIn defence of metric learning for speaker recognition
Python MIT License UpdatedDec 31, 2020 -
SlowFast Public
Forked from fanyix/SlowFastPySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Python Apache License 2.0 UpdatedAug 11, 2020