Skip to content
View huutuongtu's full-sized avatar
😀
Huh?
😀
Huh?

Block or report huutuongtu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
289 stars written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,405 11,324 Updated Sep 8, 2025

real time face swap and one-click video deepfake with only a single image

Python 75,302 10,956 Updated Nov 5, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,200 11,052 Updated Nov 6, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,900 7,484 Updated Nov 5, 2025

Inference code for Llama models

Python 58,900 9,813 Updated Jan 26, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,075 5,705 Updated Sep 10, 2025

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,319 5,733 Updated Aug 16, 2024

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,416 3,115 Updated Nov 6, 2025

A generative speech model for daily dialogue.

Python 38,103 4,132 Updated Jul 6, 2025

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 35,669 5,059 Updated Nov 6, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,367 3,884 Updated Apr 19, 2025

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,917 6,622 Updated Sep 30, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,498 6,473 Updated Nov 6, 2025

The official Meta Llama 3 GitHub site

Python 29,071 3,474 Updated Jan 26, 2025

Full reference of LinkedIn answers 2024 for skill assessments (aws-lambda, rest-api, javascript, react, git, html, jquery, mongodb, java, Go, python, machine-learning, power-point) linkedin excel t…

Python 28,659 13,133 Updated Nov 5, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,265 1,758 Updated Oct 13, 2025

Graph Neural Network Library for PyTorch

Python 23,097 3,907 Updated Nov 3, 2025

The open source developer platform to build AI/LLM applications and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integra…

Python 22,819 4,957 Updated Nov 6, 2025

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python 22,515 2,200 Updated Oct 28, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,812 2,663 Updated Jul 3, 2025

The official Python SDK for Model Context Protocol servers and clients

Python 19,880 2,727 Updated Nov 5, 2025

Devika is now Opcode

Python 19,490 2,612 Updated Sep 25, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 19,099 1,949 Updated Apr 4, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 18,594 1,969 Updated Oct 21, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,131 1,874 Updated Oct 21, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,044 3,177 Updated Nov 5, 2025

Generate audiobooks from e-books, voice cloning & 1107+ languages!

Python 14,838 1,139 Updated Nov 5, 2025

SoTA open-source TTS

Python 14,428 1,942 Updated Sep 25, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,537 1,988 Updated Nov 3, 2025

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 12,894 856 Updated Dec 17, 2024
Next