Cohere Releases Command A+ as Open Source

Cohere released Command A+, an open-source, sparse mixture-of-experts language model, and published the model weights under the Apache-2.0 license, according to VentureBeat and the Hugging Face model card. The model is reported as having 218 billion total parameters with 25 billion active parameters, a 128K context window, and support for multimodal document processing (VentureBeat; Hugging Face). BetaKit quotes Cohere co-founder Nick Frosst arguing for openness: "This tech can go one of two ways... And that only happens if things are open-sourced." BetaKit and VentureBeat note Cohere presents the release as an enterprise-grade, sovereign-AI alternative and that the weights are available on Hugging Face.
What happened
Cohere released Command A+ and published the model weights under the Apache-2.0 license, according to VentureBeat and the project's Hugging Face model card. VentureBeat and Hugging Face report the model has 218 billion total parameters with 25 billion active parameters and a 128K context length. The Hugging Face model card lists quantizations and deployment guidance and shows the model is hosted as CohereLabs/command-a-plus-05-2026-w4a4. BetaKit quotes Cohere co-founder Nick Frosst: "This tech can go one of two ways... Or it can empower the people that use it. We are working towards that second one." (BetaKit).
Technical details
Per VentureBeat and the Hugging Face model card, Command A+ is a decoder-only Sparse Mixture-of-Experts (MoE) Transformer designed for reasoning, multilingual and multimodal document tasks, and agentic workflows. The model card lists available quantizations and minimum GPU configurations for each quantization tier. VentureBeat describes the MoE design as activating about 25 billion parameters during any single generation step while the full model contains 218 billion parameters.
Editorial analysis - technical context
Sparse MoE architectures are an established pattern for reducing inference compute by routing requests to a subset of specialist subnetworks. Industry practitioners deploying MoE models typically need to plan for routing stability, load balancing across experts, and quantization testing; these operational concerns arise regardless of the specific vendor. Similarly, the availability of multiple quantizations and explicit GPU recommendations, as published on Hugging Face, accelerates practitioner evaluation and on-prem deployment workstreams.
Context and significance
Editorial analysis: Releasing frontier-weight models under permissive licenses lowers friction for enterprises and research groups that require local control, reproducibility, or regulatory alignment. Public availability of weights under Apache-2.0, combined with a 128K context window and quantization guidance, reduces barriers for self-hosting large, agentic workloads. Observers have noted (VentureBeat; BetaKit) that much of the recent open-source model momentum has been concentrated in China; Cohere's release is framed in coverage as an attempt to offer an alternative for organizations seeking open, high-performance models outside existing proprietary stacks.
What to watch
- •Adoption signals on Hugging Face (download counts, community forks) and third-party benchmarks comparing Command A+ to other open and closed models.
- •Early community reports on quantized accuracy tradeoffs and latency for recommended quantizations in real-world agentic pipelines.
- •Enterprise case studies or published integrations showing performance on multimodal document processing or long-context agent workflows.
Sources for reported facts above include VentureBeat and the Hugging Face model card for CohereLabs/command-a-plus-05-2026-w4a4, and BetaKit's coverage including a direct quote from Cohere co-founder Nick Frosst. The company blog post announcing Command A+ is also available but was used only as supplementary context in scraped snippets.
Scoring Rationale
A frontier open-source model release with permissive licensing and long-context support significantly affects practitioners evaluating self-hosting, enterprise sovereignty, and agentic workflows. The technical profile and published deployment guidance make it directly actionable for production teams.
Practice interview problems based on real data
1,500+ SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems


