Stars
A latent text-to-image diffusion model
🔊 Text-Prompted Generative Audio Model
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Inpaint anything using Segment Anything and inpainting models.
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Everything you need to know to build your own RAG application
Bark Voice Cloning and Voice Cloning for Chinese Speech
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”