vvwangvv

Wei Wang vvwangvv

SJTU SpeechLab ASR

25 followers · 9 following

Achievements

Highlights

Stars

39 stars written in Python

Clear filter

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,709 6,421 Updated Apr 30, 2026

run-llama / llama_index

LlamaIndex is the leading document agent and OCR platform

Python 50,152 7,566 Updated Jun 15, 2026

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,927 3,488 Updated Jun 15, 2026

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,398 4,512 Updated May 25, 2026

satwikkansal / wtfpython

What the f*ck Python? 😱

Python 36,990 2,669 Updated Jan 13, 2026

OpenBMB / VoxCPM

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 29,725 3,366 Updated Jun 10, 2026

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,563 1,007 Updated Jun 13, 2026

snakers4 / silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,333 785 Updated Mar 26, 2026

OpenTalker / video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,255 1,061 Updated Aug 5, 2024

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,669 485 Updated May 21, 2025

davidhalter / jedi-vim

Using the jedi autocompletion library for VIM.

Python 5,310 368 Updated May 4, 2026

Zejun-Yang / AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 5,019 619 Updated Jul 2, 2024

KoljaB / RealtimeTTS

Converts text to speech in realtime

Python 3,954 399 Updated May 31, 2026

facebookresearch / sam-audio

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,533 319 Updated May 26, 2026

OpenGVLab / InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…

Python 3,207 234 Updated Aug 20, 2024