Mistral Unveils Open-Weight Mistral 3 Models
.png)
Mistral AI announced the open-weight Mistral 3 family, comprising a Large 675B model and three smaller 14B, 8B, and 3B variants, all released under an Apache 2.0 license for commercial use and fine-tuning. The Large 3 employs a sparse Mixture-of-Experts with 41B active parameters, 256k context window and a vision encoder, while smaller models target cost-efficient self-hosting on GPUs down to RTX 4000 Ada.
Key Points
- 1Announces four Apache-2.0 open-weight models: 675B Large 3 and 14B, 8B, 3B variants.
- 2Uses sparse MoE with 41B active parameters, 256k context window, vision encoder and function-calling.
- 3Enables self-hosting and fine-tuning; supports deployment from 8x H200 clusters to RTX 4000 Ada.
Scoring Rationale
Official open-weight family with broad deployability and tooling guidance; impact limited by competitive landscape of large open models.
Sources
Public references used for this report.
Practice with real Logistics & Shipping data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Logistics & Shipping problems