robmsmt

Follow

Rob robmsmt

Follow

62 followers · 205 following

New York
05:52 (UTC -05:00)
https://robmsmt.github.io/
in/robmsmt
@robmsmt.com

Achievements

Achievements

Stars

69 stars written in Python

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,713 11,375 Updated Sep 8, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,848 11,225 Updated Nov 12, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,674 4,622 Updated Nov 11, 2025

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,947 3,243 Updated Nov 12, 2025

explosion / spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 32,803 4,621 Updated Nov 10, 2025

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,940 6,626 Updated Sep 30, 2025

deezer / spleeter

Deezer source separation library including pretrained models.

Python 27,747 3,048 Updated Apr 2, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 20,463 2,130 Updated Nov 12, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,800 1,633 Updated Jul 6, 2025

eriklindernoren / PyTorch-GAN

PyTorch implementations of Generative Adversarial Networks.

Python 17,325 4,097 Updated Jun 18, 2024

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,083 3,193 Updated Nov 11, 2025

iterative / dvc

🦉 Data Versioning and ML Experiments

Python 15,080 1,255 Updated Nov 11, 2025

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 12,347 1,947 Updated Oct 20, 2025

Miserlou / Zappa

Serverless Python

Python 11,865 1,187 Updated Mar 23, 2023

cleanlab / cleanlab

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,082 874 Updated Nov 10, 2025

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 10,772 1,598 Updated Nov 7, 2025

cython / cython

The most widely used Python to C compiler

Python 10,436 1,596 Updated Nov 12, 2025

espnet / espnet

End-to-End Speech Processing Toolkit

Python 9,577 2,346 Updated Nov 12, 2025

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

Python 8,290 963 Updated Feb 25, 2022

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,333 1,093 Updated Sep 26, 2025

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 6,443 630 Updated Oct 31, 2025

yl4579 / StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,051 631 Updated Aug 10, 2024

rspeer / python-ftfy

Fixes mojibake and other glitches in Unicode text, after the fact.

Python 3,983 123 Updated Oct 30, 2024

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 3,530 283 Updated May 21, 2025

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 3,497 347 Updated Oct 17, 2024

quantopian / qgrid

An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

Python 3,082 427 Updated Jan 12, 2024

erpalma / throttled

Workaround for Intel throttling issues in Linux.

Python 2,845 167 Updated Jan 22, 2025

zzw922cn / Automatic_Speech_Recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Python 2,842 533 Updated Mar 24, 2023

pytorch / audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,772 739 Updated Nov 11, 2025

haoheliu / AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,765 248 Updated Jun 25, 2025