Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Open-source, low-cost 10.5 GHz PLFM phased array RADAR system
A generative speech model for daily dialogue.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
This repository contains everything you need to become proficient in PyTorch
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Easily train a good VC model with voice data <= 10 mins!
PaddlePaddle GAN library, including lots of interesting applications like First-Order motion transfer, Wav2Lip, picture repair, image editing, photo2cartoon, image style transfer, GPEN, and so on.
SoftVC VITS Singing Voice Conversion
Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
Training transferable end-to-end quadrotor control policies on a laptop in 18 seconds.
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
mhe014 / ChatGPT-on-WeChat
Forked from kx-Huang/ChatGPT-on-WeChat🤖️ Deploy ChatGPT on your WeChat within 2 steps! 两步在云端部署你的微信ChatGPT聊天机器人!🤖️
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
微信HOOK、微信机器人 wxhook,数据库解密 微信公众号采集 微信公众号爬虫,企业微信HOOK
We write your reusable computer vision tools. 💜
WebRTC and ORTC implementation for Python using asyncio
Minimal and clean examples of machine learning algorithms implementations
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Reference code for "Motion-supervised Co-Part Segmentation" paper
Text2Cinemagraph: Text-Guided Synthesis of Eulerian Cinemagraphs [SIGGRAPH ASIA 2023]
serp-ai / bark-with-voice-clone
Forked from suno-ai/bark🔊 Text-prompted Generative Audio Model - With the ability to clone voices