Skip to content
View robflynnyh's full-sized avatar
🧱
🧱

Highlights

  • Pro

Block or report robflynnyh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A torch implementation of a recursion which turns out to be useful for RNN-T.

Python 148 22 Updated Aug 25, 2023

Open-source release accompanying Gao et al. 2025

Python 461 47 Updated Dec 11, 2025

GPU kernels for state space models

Python 5 Updated Feb 7, 2023

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 199 16 Updated Nov 20, 2025

On-device TTS model by Neuphonic

Python 4,280 452 Updated Dec 22, 2025

Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)

Python 3,685 521 Updated Mar 10, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,495 213 Updated Dec 16, 2025

Text to speech alignment using CTC forced alignment

Python 407 72 Updated Nov 26, 2025

LongCat Audio Tokenizer and Detokenizer

Python 264 18 Updated Dec 15, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,193 832 Updated Nov 20, 2025

A python package to analyze and compare voices with deep learning

Python 3,187 473 Updated Oct 12, 2023

Official Implementation of GLAP - General Language Audio Pretraining

Python 54 2 Updated Jun 16, 2025

A package for determining the matrix language in bilingual sentences

Python 6 Updated Jun 11, 2025

Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.

Python 87 6 Updated May 25, 2023

ConMamba for Automatic Speech Recognition

Python 100 10 Updated Aug 12, 2024

A PyTorch-based Speech Toolkit

Python 10,956 1,615 Updated Dec 15, 2025

End-to-End Speech Processing Toolkit

Python 9,652 2,363 Updated Dec 16, 2025
Jupyter Notebook 553 43 Updated Jul 10, 2024

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

Jupyter Notebook 533 70 Updated Aug 27, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,037 6,637 Updated Sep 30, 2025

Stick-breaking attention

Python 62 5 Updated Jul 1, 2025

Minimal hackable GRPO implementation

Python 306 42 Updated Jan 31, 2025

easy-to-use implementation of the ISMIR 2013 Audio Degradation Toolbox

Python 53 10 Updated Nov 19, 2019

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,509 120 Updated Nov 21, 2025

TorchCFM: a Conditional Flow Matching library

Python 2,185 177 Updated Nov 11, 2025

"pip install unet": PyTorch Implementation of 1D, 2D and 3D U-Net architecture.

Python 184 22 Updated Dec 13, 2024

Update ASR paper everyday

Python 408 20 Updated Dec 22, 2025

Code for "Neural Controlled Differential Equations for Irregular Time Series" (Neurips 2020 Spotlight)

Python 691 74 Updated Oct 22, 2022
Next