Skip to content
View robflynnyh's full-sized avatar
🧱
🧱

Highlights

  • Pro

Block or report robflynnyh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A list of all public EEG-datasets

2,963 619 Updated Oct 17, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,551 343 Updated Jun 21, 2025

SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)

Python 70 4 Updated Dec 23, 2025

[NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.

Python 206 15 Updated Dec 9, 2025

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,089 84 Updated Dec 23, 2024

Sync papers from Zotero to a reMarkable tablet

PHP 190 12 Updated Jun 1, 2020

A curated list of projects related to the reMarkable tablet

7,325 250 Updated Mar 4, 2026
Jupyter Notebook 189 12 Updated Nov 3, 2025

A Quirky Assortment of CuTe Kernels

Python 896 103 Updated Apr 4, 2026

Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality

HTML 334 21 Updated Jan 5, 2026

PyTorch implementation of the Mamba-3 architecture

Python 94 10 Updated Mar 18, 2026

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,543 1,042 Updated Apr 1, 2026

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 241 22 Updated Apr 20, 2024

DACVAE

Python 208 17 Updated Dec 22, 2025

Kernels, of the mega variety :)

Python 699 54 Updated Apr 1, 2026

Interactive visualizations of the geometric intuition behind diffusion models.

JavaScript 1,084 51 Updated Jan 31, 2026

🚀 Efficient implementations for emerging model architectures

Python 4,805 477 Updated Apr 4, 2026

A torch implementation of a recursion which turns out to be useful for RNN-T.

Python 149 22 Updated Aug 25, 2023

Open-source release accompanying Gao et al. 2025

Python 513 55 Updated Dec 11, 2025

GPU kernels for state space models

Python 7 Updated Feb 7, 2023

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 259 26 Updated Feb 27, 2026

On-device TTS model by Neuphonic

Python 5,120 562 Updated Mar 23, 2026

Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)

Python 3,732 524 Updated Mar 10, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,747 246 Updated Dec 30, 2025

Text to speech alignment using CTC forced alignment

Python 477 81 Updated Feb 23, 2026

LongCat Audio Tokenizer and Detokenizer

Python 294 22 Updated Apr 2, 2026

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,956 925 Updated Mar 4, 2026
Next