Qwen2.5-Omni is an end-to-end multimodal AI model released by Alibaba, designed to achieve comprehensive perception capabilities. It can process various input formats, including text, images, audio, and video.
Qwen2.5-VL-32B is a multimodal vision-language model released by Alibaba, featuring 3.2 billion parameters. It excels in tasks such as image understanding, mathematical reasoning, and text generation.
Reka Flash 3 is a newly released multimodal language model with 2.1 billion parameters, designed for efficient reasoning and generation.
Step-Video-TI2V is an advanced text-driven image-to-video generation model capable of producing videos up to 102 frames based on text descriptions and image inputs.
EXAONE Deep is a series of reasoning-enhanced language models launched by LG AI Research, designed to improve reasoning capabilities in fields such as mathematics, science, and programming.
Mistral Small 3.1 is an open-source multimodal AI model released by the French startup Mistral AI. It features 24 billion parameters and supports both text and image processing.
Command A is a large language model with 111 billion parameters, optimized for enterprises requiring fast, secure, and high-quality AI solutions.
Jamba 1.6: A New Open Model by AI21 Labs for Efficient Enterprise AI Solutions