[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 1,089 84 Updated Dec 23, 2024

michaelmior / zotero-remarkable

Sync papers from Zotero to a reMarkable tablet

PHP 190 12 Updated Jun 1, 2020

reHackable / awesome-reMarkable

A curated list of projects related to the reMarkable tablet

7,325 250 Updated Mar 4, 2026

google / zimtohrli

Jupyter Notebook 189 12 Updated Nov 3, 2025

Dao-AILab / quack

A Quirky Assortment of CuTe Kernels

Python 896 103 Updated Apr 4, 2026

facebookresearch / PhysicsLM4

Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality

HTML 334 21 Updated Jan 5, 2026

VikramLex / mamba3-minimal

PyTorch implementation of the Mamba-3 architecture

Python 94 10 Updated Mar 18, 2026

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 9,543 1,042 Updated Apr 1, 2026

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 241 22 Updated Apr 20, 2024

facebookresearch / dacvae

DACVAE

Python 208 17 Updated Dec 22, 2025

HazyResearch / Megakernels

Kernels, of the mega variety :)

Python 699 54 Updated Apr 1, 2026

test-time-training / discover

Python 521 68 Updated Mar 30, 2026

helblazer811 / Diffusion-Explorer

Interactive visualizations of the geometric intuition behind diffusion models.

JavaScript 1,084 51 Updated Jan 31, 2026

fla-org / flash-linear-attention

🚀 Efficient implementations for emerging model architectures

Python 4,805 477 Updated Apr 4, 2026

k2-fsa / fast_rnnt

A torch implementation of a recursion which turns out to be useful for RNN-T.

Python 149 22 Updated Aug 25, 2023

openai / circuit_sparsity

Open-source release accompanying Gao et al. 2025

Python 513 55 Updated Dec 11, 2025

sophiawisdom / ssms

GPU kernels for state space models

Python 7 Updated Feb 7, 2023

ESHyperscale / HyperscaleES

Jax Codebase for Evolutionary Strategies at the Hyperscale

Python 259 26 Updated Feb 27, 2026

adamnemecek / traceoid.ai

115 Updated Dec 1, 2024

neuphonic / neutts

On-device TTS model by Neuphonic

Python 5,120 562 Updated Mar 23, 2026

cuemacro / finmarketpy

Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)

Python 3,732 524 Updated Mar 10, 2025

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,747 246 Updated Dec 30, 2025

MahmoudAshraf97 / ctc-forced-aligner

Text to speech alignment using CTC forced alignment

Python 477 81 Updated Feb 23, 2026

meituan-longcat / LongCat-Audio-Codec

LongCat Audio Tokenizer and Detokenizer

Python 294 22 Updated Apr 2, 2026

SamsungSAILMontreal / TinyRecursiveModels

Python 6,439 1,003 Updated Apr 1, 2026

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,956 925 Updated Mar 4, 2026