Skip to content
View tomhitu's full-sized avatar
🏠
Working from home
🏠
Working from home

Sponsoring

@Torantulino

Block or report tomhitu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,229 2,104 Updated Mar 16, 2026

Enable Opencode to authenticate against Antigravity (Google's IDE) via OAuth so you can use Antigravity rate limits and access models like gemini-3-pro and claude-opus-4-5-thinking with your Google…

TypeScript 9,727 659 Updated Mar 6, 2026

Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js

JavaScript 5,167 446 Updated Dec 5, 2022

The official implementation of HierSpeech++

Python 1,242 149 Updated Feb 20, 2024

The official implementation of GTCRN, an ultra-lightweight SE model.

Python 591 96 Updated Jan 18, 2026

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 911 125 Updated Dec 2, 2025

AI-based Audio Watermarking Tool

Python 306 42 Updated Jan 7, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,142 4,038 Updated Apr 19, 2025

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,843 1,387 Updated Dec 6, 2023

Efficient neural speech synthesis

C 1,210 307 Updated Sep 21, 2024

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,469 243 Updated Dec 21, 2025

Modern audio compression for the internet.

C 3,073 757 Updated Mar 21, 2026

A markdown version emoji cheat sheet

TypeScript 13,648 4,610 Updated Mar 22, 2026

Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics (ECCV2024)

Python 134 17 Updated Aug 26, 2024

[CVPR 2023] Unofficial PyTorch implementation for CVPR2023 paper, Prototypical Residual Networks for Anomaly Detection and Localization.

Python 38 4 Updated Jul 7, 2023

Diffusion-based singing voice pitch correction

Python 138 20 Updated Sep 20, 2024

GANs with spectral normalization and projection discriminator

Python 1,101 202 Updated Nov 12, 2019

This is an official PyTorch implementation for "MuSc : Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled Images" (MuSc ICLR2024).

Python 432 37 Updated Apr 11, 2024
Python 123 30 Updated Mar 18, 2020

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,332 552 Updated Jul 27, 2024

Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"

HTML 10 4 Updated Oct 28, 2020

Code for Transformers Solve Limited Receptive Field for Monocular Depth Prediction

Python 175 20 Updated May 12, 2023

Pytorch Implementation of "Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation" paper on cityscapes dataset

Python 34 9 Updated Mar 31, 2021

This is an unofficial implementation of Reconstruction by inpainting for visual anomaly detection (RIAD).

Jupyter Notebook 102 21 Updated Nov 12, 2020

Keras documentation, hosted live at keras.io

Jupyter Notebook 2,979 2,117 Updated Mar 17, 2026

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

Python 26,398 5,412 Updated Nov 20, 2023

AI Wiki

22 5 Updated Mar 22, 2026
Python 27 6 Updated Jun 28, 2022

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,803 1,238 Updated Mar 18, 2026
Next