JetBrains introduces Mellum2, a fast 12B AI model that excels in software engineering

A year after introducing its first open AI code generation model, JetBrains has released Mellum2, a 12-billion parameter AI model focused on code generation and efficiency in production environments. Developed from scratch and distributed under the Apache 2.0 open-source license, Mellum2 offers organizations a cost-effective alternative to proprietary large language models. While building on the original Mellum model's code completion foundation, Mellum2 extends its capabilities to support both natural language and code. This allows the model to assist with routing, summarization, orchestration, and intermediate reasoning tasks within modern artificial intelligence workflows. The model's architecture uses a Mixture-of-Experts (MoE) design, activating only 2.5 billion parameters per token. This approach reduces computing requirements and supports high-throughput, low-latency inference, making Mellum2 suitable for real-time applications at scale. Mellum2 is not multimodal and has been tr...

Read Original

Related

Product Hunt tool 15h ago

CakewordAI

Point at anything to learn its name in any language Discussion | Link

Product Hunt tool 16h ago

NomNak

Find restaurants through people you trust Discussion | Link