Aya Vision is a set of advanced vision-language models designed to address multilingual performance challenges in multimodal AI systems.
Janus-Pro is a multimodal AI model recently released by the DeepSeek team, designed to achieve unified multimodal understanding and generation.
Kimi K1.5 is a new-generation multimodal reasoning model launched by Dark Side of the Moon, boasting powerful reasoning and multimodal processing capabilities.
The MiniMax-01 series, launched by Hailuo AI, comprises open-source large language models and vision multimodal models.
MiniCPM-o is a new series of edge-based multimodal large models designed to handle various inputs such as images, videos, text, and audio, and generate high-quality text and speech outputs.
CogAgent is a multimodal Vision-Language Model (VLM) jointly developed by Tsinghua University and Zhipu AI, designed specifically for understanding and interacting with graphical user interfaces (GUIs).