huutuongtu

😀

Huh?

Huu Tuong Tu huutuongtu

😀

Huh?

Strygwyr

16 followers · 62 following

Vietnam

Achievements

Lists (16)

Sort

Stars

289 stars written in Python

Clear filter

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,405 11,324 Updated Sep 8, 2025

hacksider / Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python 75,302 10,956 Updated Nov 5, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,200 11,052 Updated Nov 6, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,900 7,484 Updated Nov 5, 2025

meta-llama / llama

Inference code for Llama models

Python 58,900 9,813 Updated Jan 26, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,075 5,705 Updated Sep 10, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,319 5,733 Updated Aug 16, 2024

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,416 3,115 Updated Nov 6, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,103 4,132 Updated Jul 6, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,669 5,059 Updated Nov 6, 2025

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,367 3,884 Updated Apr 19, 2025

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,917 6,622 Updated Sep 30, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,498 6,473 Updated Nov 6, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,071 3,474 Updated Jan 26, 2025

Ebazhanov / linkedin-skill-assessments-quizzes

Full reference of LinkedIn answers 2024 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel t…

Python 28,659 13,133 Updated Nov 5, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,265 1,758 Updated Oct 13, 2025

pyg-team / pytorch_geometric

Graph Neural Network Library for PyTorch

Python 23,097 3,907 Updated Nov 3, 2025

mlflow / mlflow

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…

Python 22,819 4,957 Updated Nov 6, 2025

sinaptik-ai / pandas-ai

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 22,515 2,200 Updated Oct 28, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,812 2,663 Updated Jul 3, 2025

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 19,880 2,727 Updated Nov 5, 2025

stitionai / devika

Devika is now Opcode

Python 19,490 2,612 Updated Sep 25, 2025

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,099 1,949 Updated Apr 4, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,594 1,969 Updated Oct 21, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,131 1,874 Updated Oct 21, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,044 3,177 Updated Nov 5, 2025

DrewThomasson / ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages!

Python 14,838 1,139 Updated Nov 5, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 14,428 1,942 Updated Sep 25, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,537 1,988 Updated Nov 3, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,894 856 Updated Dec 17, 2024

Huu Tuong Tu huutuongtu

Lists (16)

Aligner

Audio Enhancement

DATASET

improve_model_architecture

Interactive AI

MDD

MLOPS

SE

Singing Voice

Speaker Diarization

Speech LLM

Speech quality assessment

Speech Separation

Speech Tokenizer

Tool

trader

Stars