Skip to content

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

License

Notifications You must be signed in to change notification settings

Talismanic/Kimi-VL

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

I have successfully acquired and analyzed the four remarkable VLM repositories: QwenLM's Qwen2.5-VL, deepseek-ai's DeepSeek-VL2, rhymes-ai's Aria, and Moonshot AI's Kimi-VL. This analysis underscores the rapid advancement in multimodal AI models.

  1. related project deepseek-ai/DeepSeek-VL2
  2. related project rhymes-ai/Aria
  3. related project QwenLM/Qwen2.5-VL

About

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published