Skip to content
View WeberJulian's full-sized avatar

Block or report WeberJulian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Training code and dataset cleasing with Sidon

Python 65 8 Updated Nov 11, 2025
Python 162 10 Updated Oct 15, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,984 1,651 Updated Nov 19, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 4,089 333 Updated Dec 20, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 41,018 3,201 Updated Dec 19, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,368 2,742 Updated Dec 20, 2025

Grok open release

Python 50,573 8,373 Updated Aug 30, 2024
JavaScript 175 19 Updated Dec 1, 2023

Faster Whisper transcription with CTranslate2

Python 19,556 1,631 Updated Nov 19, 2025

LLM inference in C/C++

C++ 91,661 14,169 Updated Dec 21, 2025

French instruction-following and chat models

Jupyter Notebook 506 49 Updated Dec 5, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 92,185 11,548 Updated Dec 15, 2025

Avocodo: Generative Adversarial Network for Artifact-free Vocoder

Python 122 15 Updated Jul 14, 2022

A walkthrough of transformer architecture code

Jupyter Notebook 371 63 Updated Feb 20, 2024

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 586 157 Updated Aug 19, 2023

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,194 205 Updated Sep 26, 2025

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

C++ 2,544 297 Updated Mar 11, 2024

A tokenizer, text cleaner, and phonemizer for many human languages.

Python 329 46 Updated Nov 15, 2024

Text-to-Speech in JavaScript using eSpeak

C++ 1,323 297 Updated Jan 30, 2020

[Does not work anymore!] Script to enable systemd support on current Ubuntu WSL2 images

Shell 1,576 394 Updated Sep 17, 2023

🐸 - A general purpose model trainer, as flexible as it gets

Python 230 144 Updated Mar 7, 2024

Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2

115 21 Updated May 20, 2019

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 1,042 97 Updated Nov 4, 2024

An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.

Python 80 18 Updated May 20, 2023

Simple but maybe too simple config management through python data classes. We use it for machine learning.

Python 107 37 Updated Apr 12, 2023

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 14,981 3,286 Updated Nov 14, 2024

TensorFlow port of first-order motion model. TF Lite and TF.js compatible, supports the original's checkpoints and implements in-graph kp processing, but inference only (no training).

Python 35 9 Updated Jun 22, 2021

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Python 842 158 Updated Oct 10, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,946 5,857 Updated Aug 16, 2024
Next