Stars
Completely automatically convert audio to vmd lips data with numerous features, using easy 1-click installer. Allowing you to lipsync your mmd models to any song or speech.
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three …
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch