Bỏ qua đến nội dung chính
Back to home
AI 1 min read

JetBrains Launches Mellum2: A Powerful 12B Mixture-of-Experts Model for Coding

JetBrains has introduced Mellum2, a new generation AI model utilizing the Mixture-of-Experts (MoE) architecture with 12 billion parameters, specifically optimized for software development tasks and deeply integrated into IDEs.

Tier 1 · sources 81% confidence Reviewed
Sources huggingface.co

JetBrains has officially launched Mellum2, a significant milestone in its AI technology autonomy.

What is Mellum2?

Mellum2 is a Large Language Model (LLM) built on the Mixture-of-Experts (MoE) architecture with a total of 12 billion parameters. Unlike the original Mellum, Mellum2 was trained from scratch on a massive dataset of high-quality source code and technical documentation.

Key Improvements

- Coding Performance: Mellum2 significantly outperforms its predecessor in code completion and code explanation benchmarks. - Speed and Efficiency: The MoE architecture allows the model to activate only the necessary "experts," reducing latency and computational costs while maintaining high accuracy. - Deep Integration: The model is designed to run seamlessly within the JetBrains IDE ecosystem via AI Assistant.

Why It Matters

JetBrains developing its own model rather than relying solely on OpenAI or Anthropic highlights the trend of developer tool companies optimizing AI for specific contexts. Mellum2 promises a smarter and more responsive coding experience for millions of developers.