tan-xu

Follow

Xu Tan (谭旭) tan-xu

Follow

ex Principal Researcher and Research Manager at Microsoft Research Asia, working on LLMs, multimodality, and generative AI for video and audio.

810 followers · 0 following

Beijing, China
https://tan-xu.github.io/
https://scholar.google.com/citations?user=tob-U1oAAAAJ
@xutan_tx

Achievements

Achievements

Stars

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 384,385 80,751 Updated Jul 28, 2026

microsoft / VibeVoice

Open-Source Frontier Voice AI

Python 50,630 5,654 Updated Jul 24, 2026

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.

Python 7,695 1,372 Updated Jul 22, 2026

FreesiaGPT / Embodied-AI

68 4 Updated Mar 20, 2026

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,694 370 Updated Jun 21, 2025

ollama / ollama

Get up and running with Kimi-K2.6, GLM-5.2, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

Go 177,066 17,143 Updated Jul 28, 2026

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 17,239 4,293 Updated Jul 28, 2026

xai-org / grok-1

Grok open release

Python 52,075 8,527 Updated Aug 30, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

17,959 1,130 Updated Jul 28, 2026

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 24,554 2,943 Updated Jul 27, 2026

hollobit / GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

953 58 Updated May 19, 2024

microsoft / muzic

Muzic: Music Understanding and Generation with Artificial Intelligence

Python 4,939 502 Updated Oct 12, 2024

AI-Guru / music-generation-research

A straightforward collection of Music Generation research resources.

606 37 Updated Jan 20, 2025

heatz123 / naturalspeech

A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)

Python 475 63 Updated Feb 7, 2024

lucidrains / naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,333 104 Updated Sep 24, 2023

openai / consistency_models

Official repo for consistency models.

Python 6,492 434 Updated Mar 22, 2024

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 25,107 2,192 Updated Jul 29, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,426 4,504 Updated Jul 13, 2026

freefq / free

翻墙、免费翻墙、免费科学上网、免费节点、免费梯子、免费ss/v2ray/trojan节点、蓝灯、谷歌商店、翻墙梯子

41,365 5,624 Updated Aug 20, 2024

mli / paper-reading

深度学习经典、新论文逐段精读

33,623 2,811 Updated Mar 22, 2025

SpeechResearch / speechresearch.github.io

HTML 44 5 Updated Jun 10, 2024

aliutkus / speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 1,050 172 Updated Jul 5, 2023

microsoft / NeuralSpeech

Python 1,461 184 Updated Feb 11, 2024

bmild / nerf

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,919 1,436 Updated Apr 12, 2025

awesome-NeRF / awesome-NeRF

A curated list of awesome neural radiance fields papers

TeX 6,777 599 Updated Jan 6, 2025

jerrygood0703 / KaraSinger

ICASSP 2022

SCSS 61 3 Updated Oct 12, 2021

microsoft / maro

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

Python 921 160 Updated Apr 24, 2025

wuhaozhe / style_avatar

A repository for generating stylized talking 3D and 3D face

Python 278 35 Updated Nov 11, 2021

resemble-ai / Resemblyzer

A python package to analyze and compare voices with deep learning

Python 3,291 486 Updated Oct 12, 2023

babysor / MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,918 5,196 Updated Mar 3, 2026