Skip to content
View vvwangvv's full-sized avatar

Highlights

  • Pro

Block or report vvwangvv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
39 stars written in Python
Clear filter

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 58,709 6,421 Updated Apr 30, 2026

LlamaIndex is the leading document agent and OCR platform

Python 50,152 7,566 Updated Jun 15, 2026

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 42,927 3,488 Updated Jun 15, 2026

Making large AI models cheaper, faster and more accessible

Python 41,398 4,512 Updated May 25, 2026

What the f*ck Python? 😱

Python 36,990 2,669 Updated Jan 13, 2026

VoxCPM2: Tokenizer-Free TTS for Multilingual Speech Generation, Creative Voice Design, and True-to-Life Cloning

Python 29,725 3,366 Updated Jun 10, 2026

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,563 1,007 Updated Jun 13, 2026

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 9,333 785 Updated Mar 26, 2026

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Python 7,255 1,061 Updated Aug 5, 2024

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,669 485 Updated May 21, 2025

Using the jedi autocompletion library for VIM.

Python 5,310 368 Updated May 4, 2026

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Python 5,019 619 Updated Jul 2, 2024

Converts text to speech in realtime

Python 3,954 399 Updated May 31, 2026

The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…

Python 3,533 319 Updated May 26, 2026

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…

Python 3,207 234 Updated Aug 20, 2024

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,838 254 Updated Dec 30, 2025

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Python 1,809 255 Updated Oct 18, 2024

Chat language model that can use tools and interpret the results

Python 1,595 119 Updated Dec 3, 2025

Some Conferences' accepted paper lists (including AI, ML, Robotic)

Python 1,337 84 Updated Jan 23, 2025

[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation

Python 1,156 150 Updated Aug 24, 2025

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Python 1,091 132 Updated Oct 18, 2024

Panda项目是于2023年5月启动的开源海外中文大语言模型项目,致力于大模型时代探索整个技术栈,旨在推动中文自然语言处理领域的创新和合作。

Python 1,034 76 Updated Oct 19, 2023

MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code

Python 823 108 Updated Oct 16, 2024

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 757 106 Updated May 12, 2026

Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice

Python 511 68 Updated Dec 22, 2025

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Python 489 34 Updated Apr 15, 2024

Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks

Python 466 82 Updated Jun 3, 2020

Versatile Evaluation of Speech and Audio

Python 416 48 Updated May 29, 2026

the missing toolbox for an async world

Python 370 28 Updated Jun 14, 2026
Next