vvwangvv

Wei Wang vvwangvv

SJTU SpeechLab ASR

25 followers · 9 following

Achievements

Highlights

Stars

vvwangvv / URGENT-MOS

Python 4 Updated May 3, 2026

yfyeung / icefall

Forked from k2-fsa/icefall

Python 7 Updated Jun 12, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,533 319 Updated May 26, 2026

wavlab-speech / versa

Versatile Evaluation of Speech and Audio

Python 416 48 Updated May 29, 2026

urgent-challenge / urgent2026_challenge_track2

Official baseline for ICASSP 2026 URGENT Challenge Track 2 (Speech Quality Assessment)

Python 31 3 Updated Jun 8, 2026

facebookresearch / omnilingual-asr

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,838 254 Updated Dec 30, 2025

nethermanpro / simulmega

Python 11 1 Updated Oct 23, 2025

OpenBMB / VoxCPM

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 29,696 3,360 Updated Jun 10, 2026

maxfischer2781 / asyncstdlib

the missing toolbox for an async world

Python 370 28 Updated Jun 14, 2026

BytedanceSpeech / seed-tts-eval

Python 1,566 147 Updated Jun 14, 2024

xingchensong / S3Tokenizer

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 511 68 Updated Dec 22, 2025

quchangle1 / LLM-Tool-Survey

This is the repository for the Tool Learning survey.

484 15 Updated Aug 9, 2025

yerfor / MimicTalk

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code

Python 823 108 Updated Oct 16, 2024

CyberAgentAILab / TANGO

[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation

Python 1,156 150 Updated Aug 24, 2025

defog-ai / sqlcoder

SoTA LLM for converting natural language questions to SQL queries

Jupyter Notebook 4,035 278 Updated May 23, 2024

run-llama / llama_index

LlamaIndex is the leading document agent and OCR platform

Python 50,150 7,565 Updated Jun 12, 2026

MeetKai / functionary

Chat language model that can use tools and interpret the results

Python 1,595 119 Updated Dec 3, 2025

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,669 485 Updated May 21, 2025

Emrys365 / DNS_text

Transcripts of the DNS Challenge test sets

7 Updated Jul 7, 2023

Zejun-Yang / AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 5,019 619 Updated Jul 2, 2024

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,255 1,061 Updated Aug 5, 2024

X-LANCE / AniTalker

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Jupyter Notebook 1,602 144 Updated Aug 15, 2024

rhasspy / piper

A fast, local neural text to speech system

C++ 11,101 1,025 Updated Aug 26, 2025

lifeiteng / naturalspeech3_facodec

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 247 23 Updated Apr 20, 2024

MStypulkowski / diffused-heads

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Python 489 34 Updated Apr 15, 2024

vvwangvv / SpliceTTS

Python 2 Updated Nov 24, 2023

sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 757 106 Updated May 12, 2026

yang-song / score_sde

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,833 230 Updated Nov 29, 2022

Ninjabrain1 / Ninjabrain-Bot

Accurate stronghold calculator for Minecraft speedrunning.

Java 752 97 Updated Jun 11, 2026

JosephPai / Awesome-Talking-Face

📖 A curated list of resources dedicated to talking face.

1,541 121 Updated Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wei Wang vvwangvv

Achievements

Achievements

Highlights

Block or report vvwangvv

Stars

vvwangvv / URGENT-MOS

yfyeung / icefall

facebookresearch / sam-audio

wavlab-speech / versa

urgent-challenge / urgent2026_challenge_track2

facebookresearch / omnilingual-asr

nethermanpro / simulmega

OpenBMB / VoxCPM

maxfischer2781 / asyncstdlib

BytedanceSpeech / seed-tts-eval

xingchensong / S3Tokenizer

quchangle1 / LLM-Tool-Survey

yerfor / MimicTalk

CyberAgentAILab / TANGO

defog-ai / sqlcoder

run-llama / llama_index

MeetKai / functionary

OpenBMB / ToolBench

Emrys365 / DNS_text

Zejun-Yang / AniPortrait

OpenTalker / video-retalking

X-LANCE / AniTalker

rhasspy / piper

lifeiteng / naturalspeech3_facodec

MStypulkowski / diffused-heads

vvwangvv / SpliceTTS

sp-uhh / sgmse

yang-song / score_sde

Ninjabrain1 / Ninjabrain-Bot

JosephPai / Awesome-Talking-Face