Skip to content
View m-toman's full-sized avatar

Block or report m-toman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 490 15 Updated Nov 18, 2025

NeMo: a toolkit for conversational AI

Jupyter Notebook 1 Updated Apr 29, 2022

A tokenizer, text cleaner, and phonemizer for many human languages.

Python 329 46 Updated Nov 15, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,286 544 Updated Jul 27, 2024

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,632 349 Updated Apr 22, 2024

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 1,030 213 Updated Aug 28, 2023

The Julia Programming Language

Julia 48,126 5,693 Updated Dec 22, 2025

Swift for TensorFlow

Jupyter Notebook 6,146 614 Updated Jan 12, 2022

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,052 9,396 Updated Dec 15, 2025

2018/2019 TTS framework integrating state of the art open source methods

Jupyter Notebook 48 4 Updated Jul 8, 2019

x86-64 assembler embedded in Python

Python 2,041 168 Updated Sep 25, 2023

WaveNet-Vocoder implementation with pytorch.

Shell 300 58 Updated Jun 8, 2020

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,076 26,328 Updated Dec 22, 2025

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,319 906 Updated Jul 6, 2023

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 10,087 1,326 Updated Nov 9, 2023

Edinburgh Speech Tools

C++ 61 28 Updated Jun 10, 2023

This is now the official location of the Merlin project.

Python 1 1 Updated Apr 26, 2018

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Python 80 31 Updated Oct 14, 2019
C++ 408 95 Updated Nov 30, 2021

Caffe2 is a lightweight, modular, and scalable deep learning framework.

Shell 8,400 1,922 Updated Feb 7, 2023

Magenta: Music and Art Generation with Machine Intelligence

Python 19,759 3,806 Updated Jul 8, 2025

header only, dependency-free deep learning framework in C++14

C++ 6,011 1,399 Updated Apr 17, 2022

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,514 5,911 Updated Dec 22, 2025

This is now the official location of the Merlin project.

Python 1,320 437 Updated Mar 3, 2020

A high-quality speech analysis, manipulation and synthesis system

C++ 1,286 261 Updated Feb 21, 2025

Open Source Computer Vision Library

C++ 85,394 56,421 Updated Dec 22, 2025

An Open Source Machine Learning Framework for Everyone

C++ 192,907 75,187 Updated Dec 22, 2025

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,826 6,745 Updated Oct 25, 2023

A list of popular github projects related to deep learning

Python 6,096 1,225 Updated Feb 16, 2024

Mono open source ECMA CLI, C# and .NET implementation.

C# 11,410 3,837 Updated Aug 27, 2024
Next