A year after introducing its first open AI code generation model, JetBrains has released Mellum2, a 12-billion parameter AI model focused on code generation and efficiency in production environments. Developed from scratch and distributed under the Apache 2.0 open-source license, Mellum2 offers organizations a cost-effective alternative to proprietary large language models. While building on the original Mellum model's code completion foundation, Mellum2 extends its capabilities to support both natural language and code. This allows the model to assist with routing, summarization, orchestration, and intermediate reasoning tasks within modern artificial intelligence workflows. The model's architecture uses a Mixture-of-Experts (MoE) design, activating only 2.5 billion parameters per token. This approach reduces computing requirements and supports high-throughput, low-latency inference, making Mellum2 suitable for real-time applications at scale. Mellum2 is not multimodal and has been tr...
Related
CakewordAI
Point at anything to learn its name in any language Discussion | Link
NomNak
Find restaurants through people you trust Discussion | Link
Coinbase for Agents can manage trading, payments, and execute workflows on your behalf
Coinbase has introduced Coinbase for Agents, enabling users to connect AI agents directly to their Coinbase accounts for trading, payments, and other financial tasks. These agents ...