Skip to content
View m-toman's full-sized avatar

Block or report m-toman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open Source Machine Learning Framework for Everyone

C++ 194,918 75,286 Updated Apr 28, 2026

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,493 27,608 Updated Apr 28, 2026

Open Source Computer Vision Library

C++ 87,264 56,544 Updated Apr 28, 2026

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,863 5,988 Updated Apr 28, 2026

The Julia Programming Language

Julia 48,628 5,767 Updated Apr 27, 2026

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,644 9,405 Updated Mar 9, 2026

A high-quality speech analysis, manipulation and synthesis system

C++ 1,312 264 Updated Feb 18, 2026

Magenta: Music and Art Generation with Machine Intelligence

Python 19,777 3,779 Updated Jan 6, 2026

[ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling

Python 519 19 Updated Nov 18, 2025

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,379 5,361 Updated Sep 22, 2025

A tokenizer, text cleaner, and phonemizer for many human languages.

Python 331 45 Updated Nov 15, 2024

Mono open source ECMA CLI, C# and .NET implementation.

C# 11,437 3,817 Updated Aug 27, 2024

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,342 553 Updated Jul 27, 2024

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,641 351 Updated Apr 22, 2024

A list of popular github projects related to deep learning

Python 6,134 1,232 Updated Feb 16, 2024

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Jupyter Notebook 10,138 1,325 Updated Nov 9, 2023

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,811 6,711 Updated Oct 25, 2023

x86-64 assembler embedded in Python

Python 2,052 167 Updated Sep 25, 2023

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 1,040 213 Updated Aug 28, 2023

DeepMind's Tacotron-2 Tensorflow implementation

Python 2,319 904 Updated Jul 6, 2023

Edinburgh Speech Tools

C++ 63 29 Updated Jun 10, 2023

Caffe2 is a lightweight, modular, and scalable deep learning framework.

Shell 8,385 1,909 Updated Feb 7, 2023

NeMo: a toolkit for conversational AI

Jupyter Notebook 1 Updated Apr 29, 2022

header only, dependency-free deep learning framework in C++14

C++ 6,022 1,393 Updated Apr 17, 2022

Swift for TensorFlow

Jupyter Notebook 6,135 611 Updated Jan 12, 2022
C++ 410 96 Updated Nov 30, 2021

Frontend system for HMM-based speech synthesis models generated by HTS.

C 40 7 Updated Apr 5, 2021

WaveNet-Vocoder implementation with pytorch.

Shell 300 59 Updated Jun 8, 2020

This is now the official location of the Merlin project.

Python 1,321 432 Updated Mar 3, 2020

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Python 80 31 Updated Oct 14, 2019
Next