tomhitu

🏠

Working from home

TomHitU tomhitu

🏠

Working from home

Ph.D. in NLP Master in Software Engineer Bachelor in IoT

8 followers · 9 following

tomhitu

Sponsoring

Achievements

Starred repositories

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 14,229 2,104 Updated Mar 16, 2026

NoeFabris / opencode-antigravity-auth

Enable Opencode to authenticate against Antigravity (Google's IDE) via OAuth so you can use Antigravity rate limits and access models like gemini-3-pro and claude-opus-4-5-thinking with your Google…

TypeScript 9,727 659 Updated Mar 6, 2026

tensorspace-team / tensorspace

Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js

JavaScript 5,167 446 Updated Dec 5, 2022

sh-lee-prml / HierSpeechpp

The official implementation of HierSpeech++

Python 1,242 149 Updated Feb 20, 2024

Xiaobin-Rong / gtcrn

The official implementation of GTCRN, an ultra-lightweight SE model.

Python 591 96 Updated Jan 18, 2026

k2-fsa / ZipVoice

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 911 125 Updated Dec 2, 2025

wavmark / wavmark

AI-based Audio Watermarking Tool

Python 306 42 Updated Jan 7, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 36,142 4,038 Updated Apr 19, 2025

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,843 1,387 Updated Dec 6, 2023

xiph / LPCNet

Efficient neural speech synthesis

C 1,210 307 Updated Sep 21, 2024

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 3,469 243 Updated Dec 21, 2025

xiph / opus

Modern audio compression for the internet.

C 3,073 757 Updated Mar 21, 2026

ikatyang / emoji-cheat-sheet

A markdown version emoji cheat sheet

TypeScript 13,648 4,610 Updated Mar 22, 2026

EnVision-Research / Defect_Spectrum

Defect Spectrum: A Granular Look of Large-Scale Defect Datasets with Rich Semantics (ECCV2024)

Python 134 17 Updated Aug 26, 2024

xcyao00 / PRNet

[CVPR 2023] Unofficial PyTorch implementation for CVPR2023 paper, Prototypical Residual Networks for Anomaly Detection and Localization.

Python 38 4 Updated Jul 7, 2023

haidog-yaqub / DiffPitcher

Diffusion-based singing voice pitch correction

Python 138 20 Updated Sep 20, 2024

sannawag / data_driven_pitch_corrector

Python 164 32 Updated Jun 26, 2021

pfnet-research / sngan_projection

GANs with spectral normalization and projection discriminator

Python 1,101 202 Updated Nov 12, 2019

xrli-U / MuSc

This is an official PyTorch implementation for "MuSc : Zero-Shot Industrial Anomaly Classification and Segmentation with Mutual Scoring of the Unlabeled Images" (MuSc ICLR2024).

Python 432 37 Updated Apr 11, 2024

jzbontar / pixelcnn-pytorch

Python 123 30 Updated Mar 18, 2020

jik876 / hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python 2,332 552 Updated Jul 27, 2024

jik876 / hifi-gan-demo

Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"

HTML 10 4 Updated Oct 28, 2020

ygjwd12345 / TransDepth

Code for Transformers Solve Limited Receptive Field for Monocular Depth Prediction

Python 175 20 Updated May 12, 2023

navamikairanda / R2U-Net

Pytorch Implementation of "Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation" paper on cityscapes dataset

Python 34 9 Updated Mar 31, 2021