Skip to content
View jcvasquezc's full-sized avatar

Block or report jcvasquezc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model

Python 118 16 Updated Mar 28, 2026

[EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Python 150 5 Updated May 18, 2025

"Cyberpunk style" for matplotlib plots

Python 1,836 78 Updated Aug 6, 2025

Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS

Python 957 142 Updated Oct 2, 2024

Whisper realtime streaming for long speech-to-text transcription and translation

Python 3,604 412 Updated Nov 12, 2025

Generating Talking Face Landmarks from Speech

Python 159 44 Updated Dec 22, 2022

Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.

Python 361 89 Updated May 23, 2023

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,688 414 Updated Apr 3, 2024

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Python 2,663 296 Updated Oct 18, 2024

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Python 2,805 210 Updated Sep 9, 2025

Flower: A Friendly Federated AI Framework

Python 6,858 1,179 Updated Apr 27, 2026

Source code for LCN submission for ADReSS-M challenge (formerly called MADReSS).

Python 15 5 Updated Jun 1, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 22,107 2,696 Updated Jan 23, 2026

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 21,530 2,238 Updated Apr 4, 2026

😸 💬 A module to compute textual lexical richness (aka lexical diversity).

Python 112 22 Updated Aug 27, 2023

A Python wrapper for the high-quality vocoder "World"

Cython 786 126 Updated Jan 21, 2025
Jupyter Notebook 1,028 226 Updated Mar 20, 2024

This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks

Python 523 92 Updated Oct 11, 2019

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

Python 2,911 491 Updated Mar 31, 2023

Single shot neural network pruning before training the model, based on connection sensitivity

Jupyter Notebook 11 2 Updated Aug 7, 2019

Low-level Python library used to interact with a Substra network

Python 279 35 Updated Oct 14, 2024

Papr Readr Bot

Python 7 Updated Nov 29, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 8,909 766 Updated Mar 26, 2026

an editor for spoken-word audio with automatic transcription

TypeScript 1,839 57 Updated Jan 6, 2026

Robust Speech Recognition via Large-Scale Weak Supervision

Python 98,503 12,109 Updated Apr 15, 2026

A collection of utilities for handling IPA phones.

Python 27 2 Updated Sep 24, 2023

Automatic classification of the Big-Five personality traits from texts using embeddings and Long short-term memory network.

Jupyter Notebook 1 Updated Jun 9, 2020

Dr.VOT is an a software package for automatic measurement of voice onset time (VOT).

Python 32 10 Updated Jul 25, 2023

add statistical significance annotations on seaborn plots. Further development of statannot, with bugfixes, new features, and a different API.

Python 837 82 Updated Jan 8, 2026
Next