Skip to content
View huutuongtu's full-sized avatar
😀
Huh?
😀
Huh?

Block or report huutuongtu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
19 results for sponsorable starred repositories
Clear filter

An Open Source Python alternative to NotebookLM's podcast feature: Transforming Multimodal Content into Captivating Multilingual Audio Conversations with GenAI

Python 5,583 648 Updated Oct 31, 2025

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

MDX 35,158 2,983 Updated Oct 24, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,144 11,045 Updated Nov 5, 2025

real time face swap and one-click video deepfake with only a single image

Python 75,282 10,954 Updated Nov 5, 2025

zero-vocab or low-vocab embeddings

Python 18 1 Updated Jul 17, 2022

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 873 40 Updated Oct 28, 2025

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 7,999 827 Updated Aug 21, 2025

The Unofficial TikTok API Wrapper In Python

Python 5,865 1,113 Updated Oct 14, 2025

Sequence alignement methods with helpers for PyTorch.

Python 24 3 Updated Nov 30, 2022

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 220 21 Updated Apr 20, 2024

Audio Codec Speech processing Universal PERformance Benchmark

Python 275 26 Updated Jul 2, 2025

AudioLDM training, finetuning, evaluation and inference.

Python 278 54 Updated Dec 13, 2024
Jupyter Notebook 546 66 Updated Jul 25, 2023

Audio generation using diffusion models, in PyTorch.

Python 2,080 178 Updated Jun 12, 2023

🔊 Text-prompted Generative Audio Model - With the ability to clone voices

Jupyter Notebook 3,332 451 Updated Aug 24, 2025

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,182 333 Updated Sep 10, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,590 1,968 Updated Oct 21, 2025

The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.

313 154 Updated Mar 5, 2022

📝 Algorithms and data structures implemented in JavaScript with explanations and links to further readings

JavaScript 193,882 30,939 Updated Oct 22, 2025