- New York
-
05:52
(UTC -05:00) - https://robmsmt.github.io/
- in/robmsmt
- @robmsmt.com
Stars
Robust Speech Recognition via Large-Scale Weak Supervision
A high-throughput and memory-efficient inference and serving engine for LLMs
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
đź’« Industrial-strength Natural Language Processing (NLP) in Python
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Deezer source separation library including pretrained models.
Fast and memory-efficient exact attention
A TTS model capable of generating ultra-realistic dialogue in one pass.
PyTorch implementations of Generative Adversarial Networks.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Tools for merging pretrained large language models.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Fixes mojibake and other glitches in Unicode text, after the fact.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Noise supression using deep filtering
An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks
Workaround for Intel throttling issues in Linux.
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Data manipulation and transformation for audio signal processing, powered by PyTorch
AudioLDM: Generate speech, sound effects, music and beyond, with text.