huutuongtu

Follow

😀

Huh?

Huu Tuong Tu huutuongtu

😀

Huh?

Follow

Strygwyr

16 followers · 62 following

Vietnam

Achievements

Achievements

Lists (16)

Sort

Aligner

Aligner for TTS, ASR, ...

Audio Enhancement

DATASET

improve_model_architecture

15 repositories

Interactive AI

MDD

MLOPS

SE

Singing Voice

Speaker Diarization

Speech LLM

14 repositories

Speech quality assessment

Speech Separation

Speech Tokenizer

10 repositories

Tool

trader

Stars

19 results for sponsorable starred repositories

souzatharsis / podcastfy

An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI

Python 5,585 648 Updated Oct 31, 2025

PatrickJS / awesome-cursorrules

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

MDX 35,181 2,989 Updated Oct 24, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,295 11,071 Updated Nov 6, 2025

hacksider / Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python 75,344 10,964 Updated Nov 5, 2025

ChenghaoMou / embeddings

zero-vocab or low-vocab embeddings

Python 18 1 Updated Jul 17, 2022

lifeiteng / OmniSenseVoice

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 873 40 Updated Oct 28, 2025

YaoFANGUK / video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 8,001 829 Updated Aug 21, 2025

davidteather / TikTok-Api

The Unofficial TikTok API Wrapper In Python

Python 5,870 1,112 Updated Oct 14, 2025

archinetai / aligner-pytorch

Sequence alignement methods with helpers for PyTorch.

Python 24 3 Updated Nov 30, 2022

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 220 21 Updated Apr 20, 2024

voidful / Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Python 275 26 Updated Jul 2, 2025

haoheliu / AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

Python 278 54 Updated Dec 13, 2024

camenduru / MusicGen-colab

Jupyter Notebook 546 66 Updated Jul 25, 2023

archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Python 2,080 178 Updated Jun 12, 2023

serp-ai / bark-with-voice-clone

Forked from suno-ai/bark

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,332 451 Updated Aug 24, 2025

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,181 333 Updated Sep 10, 2025

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,605 1,970 Updated Oct 21, 2025

philipperemy / timit

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

313 154 Updated Mar 5, 2022

trekhleb / javascript-algorithms

📝 Algorithms and data structures implemented in JavaScript with explanations and links to further readings

JavaScript 193,892 30,940 Updated Oct 22, 2025