Lists (2)
Sort Name ascending (A-Z)
Stars
A latent text-to-image diffusion model
🔊 Text-Prompted Generative Audio Model
A simple screen parsing tool towards pure vision based GUI agent
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.