-
minicpm-o2.6
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
vision 8b26.5K Pulls 13 Tags Updated 11 months ago
-
minicpm-v4.5
A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
vision 8b17.2K Pulls 11 Tags Updated 8 months ago
-
minicpm-o4.5
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Mulitmodal Live Streaming on Your Phone
vision 8b6,669 Pulls 12 Tags Updated 3 months ago
-
minicpm-v4.6
A Pocket-Sized MLLM for Ultra-Efficient Image and Video Understanding on Your Phone
vision3,246 Pulls 12 Tags Updated 6 days ago
-
minicpm-v2.6
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 8b2,359 Pulls 12 Tags Updated 11 months ago
-
minicpm-v4
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 4b1,725 Pulls 12 Tags Updated 9 months ago
-
minicpm4.1
highly efficient large language models (LLMs) designed explicitly for end-side devices
1,237 Pulls 1 Tag Updated 8 months ago
-
minicpm-v2.5
A GPT-4V Level Multimodal LLM on Your Phone
vision 8b391 Pulls 13 Tags Updated 11 months ago