-
minicpm-o2.6
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
vision 8b22.7K Pulls 13 Tags Updated 8 months ago
-
minicpm-v4.5
A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
vision 8b10.7K Pulls 11 Tags Updated 5 months ago
-
minicpm-o4.5
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Mulitmodal Live Streaming on Your Phone
vision 8b2,096 Pulls 12 Tags Updated 1 week ago
-
minicpm-v2.6
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 8b1,583 Pulls 12 Tags Updated 8 months ago
-
minicpm-v4
A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
vision 4b921 Pulls 12 Tags Updated 5 months ago
-
minicpm4.1
highly efficient large language models (LLMs) designed explicitly for end-side devices
411 Pulls 1 Tag Updated 5 months ago
-
minicpm-v2.5
A GPT-4V Level Multimodal LLM on Your Phone
vision 8b272 Pulls 13 Tags Updated 8 months ago