- Salzburg, Austria
- https://www.linkedin.com/in/markustoman/
Stars
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling
m-toman / NeMo
Forked from NVIDIA-NeMo/NeMoNeMo: a toolkit for conversational AI
A tokenizer, text cleaner, and phonemizer for many human languages.
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Clone a voice in 5 seconds to generate arbitrary speech in real-time
2018/2019 TTS framework integrating state of the art open source methods
WaveNet-Vocoder implementation with pytorch.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
DeepMind's Tacotron-2 Tensorflow implementation
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
felipeespic / merlin
Forked from CSTR-Edinburgh/merlinThis is now the official location of the Merlin project.
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Caffe2 is a lightweight, modular, and scalable deep learning framework.
Magenta: Music and Art Generation with Machine Intelligence
header only, dependency-free deep learning framework in C++14
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
This is now the official location of the Merlin project.
A high-quality speech analysis, manipulation and synthesis system
An Open Source Machine Learning Framework for Everyone
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
A list of popular github projects related to deep learning
Mono open source ECMA CLI, C# and .NET implementation.