Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
Python interface to the WebRTC Voice Activity Detector
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Keyword spotting on Arm Cortex-M Microcontrollers
Gibson Environments: Real-World Perception for Embodied Agents
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
Dynamic Memory Management for Serving LLMs without PagedAttention
Simultaneous localization and mapping using fiducial markers.
ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions