huutuongtu

😀

Huh?

Huu Tuong Tu huutuongtu

😀

Huh?

Strygwyr

16 followers · 62 following

Vietnam

Achievements

Lists (16)

Sort

Stars

289 stars written in Python

Clear filter

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,477 11,331 Updated Sep 8, 2025

hacksider / Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Python 75,394 10,969 Updated Nov 5, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,404 11,099 Updated Nov 7, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,009 7,495 Updated Nov 6, 2025

meta-llama / llama

Inference code for Llama models

Python 58,905 9,812 Updated Jan 26, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,109 5,708 Updated Sep 10, 2025

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,336 5,741 Updated Aug 16, 2024

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,436 3,117 Updated Nov 7, 2025

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 38,117 4,132 Updated Jul 6, 2025

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,687 5,061 Updated Nov 6, 2025

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,386 3,889 Updated Apr 19, 2025

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,922 6,623 Updated Sep 30, 2025

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,511 6,478 Updated Nov 7, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,072 3,477 Updated Jan 26, 2025

Ebazhanov / linkedin-skill-assessments-quizzes

Full reference of LinkedIn answers 2024 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel t…

Python 28,659 13,128 Updated Nov 5, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,276 1,764 Updated Oct 13, 2025

pyg-team / pytorch_geometric

Graph Neural Network Library for PyTorch

Python 23,104 3,911 Updated Nov 7, 2025

mlflow / mlflow

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…

Python 22,841 4,964 Updated Nov 7, 2025

sinaptik-ai / pandas-ai

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 22,526 2,200 Updated Oct 28, 2025

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,817 2,665 Updated Jul 3, 2025

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 19,908 2,734 Updated Nov 6, 2025

stitionai / devika

Devika is now Opcode

Python 19,491 2,612 Updated Sep 25, 2025

kaixindelele / ChatPaper

Use ChatGPT to summarize the arXiv papers. 全流程加速科研，利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,102 1,949 Updated Apr 4, 2024

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,619 1,975 Updated Oct 21, 2025

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,158 1,876 Updated Oct 21, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,056 3,181 Updated Nov 6, 2025

DrewThomasson / ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages!

Python 14,932 1,149 Updated Nov 5, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 14,444 1,949 Updated Sep 25, 2025

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,542 1,987 Updated Nov 3, 2025

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,902 856 Updated Dec 17, 2024

Huu Tuong Tu huutuongtu

Lists (16)

Aligner

Audio Enhancement

DATASET

improve_model_architecture

Interactive AI

MDD

MLOPS

SE

Singing Voice

Speaker Diarization

Speech LLM

Speech quality assessment

Speech Separation

Speech Tokenizer

Tool

trader

Stars