Skip to content
View tan-xu's full-sized avatar

Block or report tan-xu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-Source Frontier Voice AI

Python 19,074 2,110 Updated Dec 17, 2025

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,697 897 Updated Dec 18, 2025

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,419 322 Updated Jun 21, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 158,308 14,013 Updated Dec 24, 2025

Ongoing research training transformer models at scale

Python 14,726 3,417 Updated Dec 26, 2025

Grok open release

Python 50,567 8,372 Updated Aug 30, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,064 1,098 Updated Dec 26, 2025

Fast and memory-efficient exact attention

Python 21,309 2,250 Updated Dec 26, 2025

ChatGPT, GenerativeAI and LLMs Timeline

954 58 Updated May 19, 2024

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,888 495 Updated Oct 12, 2024

A straightforward collection of Music Generation research resources.

601 37 Updated Jan 20, 2025

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Python 475 65 Updated Feb 7, 2024

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,332 105 Updated Sep 24, 2023

Official repo for consistency models.

Python 6,458 437 Updated Mar 22, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,496 2,050 Updated Jul 29, 2025

Making large AI models cheaper, faster and more accessible

Python 41,309 4,544 Updated Dec 22, 2025

翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子

38,951 5,682 Updated Aug 20, 2024

深度学习经典、新论文逐段精读

32,236 2,764 Updated Mar 22, 2025

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 1,029 172 Updated Jul 5, 2023
Python 1,455 185 Updated Feb 11, 2024

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,734 1,444 Updated Apr 12, 2025

A curated list of awesome neural radiance fields papers

TeX 6,745 601 Updated Jan 6, 2025

ICASSP 2022

SCSS 61 3 Updated Oct 12, 2021

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

Python 905 160 Updated Apr 24, 2025

A repository for generating stylized talking 3D and 3D face

Python 279 37 Updated Nov 11, 2021

A python package to analyze and compare voices with deep learning

Python 3,189 474 Updated Oct 12, 2023

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,840 5,270 Updated Nov 13, 2025

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

C# 996 542 Updated Dec 26, 2025
Next