Skip to content
View huutuongtu's full-sized avatar
😀
Huh?
😀
Huh?

Block or report huutuongtu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
289 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,477 11,331 Updated Sep 8, 2025

real time face swap and one-click video deepfake with only a single image

Python 75,394 10,969 Updated Nov 5, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,404 11,099 Updated Nov 7, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,009 7,495 Updated Nov 6, 2025

Inference code for Llama models

Python 58,905 9,812 Updated Jan 26, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,109 5,708 Updated Sep 10, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,336 5,741 Updated Aug 16, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,436 3,117 Updated Nov 7, 2025

A generative speech model for daily dialogue.

Python 38,117 4,132 Updated Jul 6, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,687 5,061 Updated Nov 6, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,386 3,889 Updated Apr 19, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,922 6,623 Updated Sep 30, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,511 6,478 Updated Nov 7, 2025

The official Meta Llama 3 GitHub site

Python 29,072 3,477 Updated Jan 26, 2025

Full reference of LinkedIn answers 2024 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel t…

Python 28,659 13,128 Updated Nov 5, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,276 1,764 Updated Oct 13, 2025

Graph Neural Network Library for PyTorch

Python 23,104 3,911 Updated Nov 7, 2025

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…

Python 22,841 4,964 Updated Nov 7, 2025

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 22,526 2,200 Updated Oct 28, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,817 2,665 Updated Jul 3, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 19,908 2,734 Updated Nov 6, 2025

Devika is now Opcode

Python 19,491 2,612 Updated Sep 25, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,102 1,949 Updated Apr 4, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,619 1,975 Updated Oct 21, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,158 1,876 Updated Oct 21, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,056 3,181 Updated Nov 6, 2025

Generate audiobooks from e-books, voice cloning & 1107+ languages!

Python 14,932 1,149 Updated Nov 5, 2025

SoTA open-source TTS

Python 14,444 1,949 Updated Sep 25, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,542 1,987 Updated Nov 3, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,902 856 Updated Dec 17, 2024
Next