-
Hugging Face
- France
- https://ebezzam.github.io/
- @ericbezzam
- in/eric-bezzam
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
Mount Hugging Face Buckets and repos as local filesystems. No download, no copy, no waiting.
VoiceBench: Benchmarking LLM-Based Voice Assistants
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.
Official Implementation of "ALARM: Audio–Language Alignment for Reasoning Models"
Benchmarking Large Language Models using the Eleusis card game
Real-time text-to-speech with Qwen3-TTS
The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.
Riva Python client API and CLI utils
Soprano: Instant, Ultra-Realistic Text-to-Speech
Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
The Hugging Face Course on Transformers for Audio
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Whisper realtime streaming for long speech-to-text transcription and translation