Qwen2.5-Coder is the latest open-source model in Alibaba's Qwen series, focused on tasks such as code generation, inference, and repair.
Hunyuan-Large is Tencent’s recently open-sourced, large-scale Mixture of Experts (MoE) model, featuring 3.89 trillion total parameters and 52 billion active parameters.
SmolLM2 is a series of compact language models recently released by Hugging Face, designed specifically for on-device applications.
MobileLLM is a highly efficient language model launched by Meta, specifically designed for mobile devices and resource-constrained environments.
Aya Expanse, developed by Cohere For AI, is an advanced multilingual large language model designed to bridge the gap between artificial intelligence and language. The model supports 23 languages and is available in two versions: 8B (8 billion parameters) and 32B (32 billion parameters).
Founded in 2020 and headquartered in Palo Alto, California, Zyphra is a company focused on advancing artificial intelligence technologies. The company is committed to developing cutting-edge AI models and services to address challenges across industries, promoting edge intelligence, and driving the evolution of multimodal AI platforms.
NVLM 1.0 is a series of cutting-edge multimodal large language models (LLMs) launched by NVIDIA, designed to achieve state-of-the-art results in vision-language tasks.
Liquid Foundation Models (LFMs): Next-Generation Generative AI Models by Liquid AI
Molmo AI is a series of open-source multimodal artificial intelligence models developed by the Allen Institute for AI (Ai2). These models are designed to handle various types of data, including text, images, audio, and video, with broad application potential.
Mengzi GPT is a generative large language model launched by Lanzhou Technology, specializing in applications across various generation scenarios.