Portugal Launches Amalia Open Source Portuguese Language Model
Portugal officially unveiled Amalia on July 1, 2026, becoming the first open large language model built specifically for European Portuguese, developed by a university consortium led by NOVA University Lisbon and funded with EUR 7 million through the country's Recovery and Resilience Plan. Rather than training from scratch, more than sixty researchers adapted the open EuroLLM-9B base model and the earlier GlorIA model, expanding its context window to 32,000 tokens and releasing a roughly 9-billion-parameter text model plus vision and speech components under the Apache 2.0 license on Hugging Face. For teams evaluating national AI-sovereignty strategies, Amalia is a concrete, lower-cost template: adapting an existing open model rather than pretraining end-to-end kept the whole project at research-grant scale, a pattern also used by Spain's ALIA and Germany's Teuken-7B.
Amalia's real significance isn't a benchmark score, it's a budget line: a country of roughly ten million people delivered a sovereign, open LLM adapted to its own language for about the cost of a single university research grant rather than a hyperscaler contract. That reframes "AI sovereignty" for smaller nations and public-sector teams as an adaptation problem, not a from-scratch pretraining problem.
What happened
Portugal's government unveiled Amalia on July 1, 2026 at Instituto Superior Tecnico's innovation center in Lisbon, with Prime Minister Luis Montenegro and ministers Fernando Alexandre and Goncalo Matias in attendance. The government describes Amalia, an acronym for Assistente Multimodal Automatico de Linguagem com Inteligencia Artificial, as the first open LLM built for European Portuguese. A consortium of more than sixty researchers, coordinated by NOVA University Lisbon alongside Instituto Superior Tecnico and the universities of Coimbra, Porto, and Minho, built it with Foundation for Science and Technology support, funded through Portugal's Recovery and Resilience Plan at EUR 5.5 million initially plus a further EUR 1.5 million earmarked through 2027. Alongside the launch, the government introduced IA.GOV.PT, the official AI portal for Portugal's public administration. "This is an important step in positioning Portugal in artificial intelligence, also showcasing the talent and knowledge of academia in service of society and the country," said Joao Magalhaes, a NOVA LINCS researcher and project coordinator, according to NOVA FCT's official announcement.
Technical context
Rather than pretraining from scratch, the team extended the pretraining of EuroLLM-9B, an open European multilingual model, and drew on the earlier Portuguese GlorIA model, expanding the context window to 32,000 tokens. Per the team's own technical report, accepted at PROPOR 2026, the released roughly 9-billion-parameter text model, alongside a vision model and speech-recognition component, is published under the Apache 2.0 license on Hugging Face under the amalia-llm organization; the paper reports Amalia matches strong baselines on translated benchmarks while substantially improving on European-Portuguese-specific evaluations versus machine-translated alternatives.
Industry context
Amalia extends a pattern already used elsewhere in Europe: the Basque Country's Latxa adapted Llama 2 for Euskara, Spain's ALIA trained a 40-billion-parameter model at the Barcelona Supercomputing Center, and Germany's Teuken-7B came out of the public OpenGPT-X consortium for roughly EUR 14 million. By contrast, France, home to Mistral AI (valued near EUR 11.7 billion in its September 2025 Series C), has no publicly funded national language model; its closest tool, Albert, aggregates third-party open models rather than training one. The EU-wide OpenEuroLLM project, launched February 2025 with about twenty member organizations and access to EuroHPC supercomputers, aims to formalize this adaptation-based approach across all official EU languages.
For practitioners
The reusable part of Portugal's approach is the adaptation pipeline, not the specific model: start from an open multilingual base like EuroLLM, extend pretraining and context length with targeted local-language data, and layer in modality-specific components, rather than funding a full pretraining run. That pipeline is what put a national LLM within reach of a university consortium and a research-grade budget instead of the tens-to-hundreds-of-millions typical of frontier training runs.
What to watch
Whether Amalia sees real adoption in the public-service, healthcare, and education use cases the government has flagged; whether the OpenEuroLLM project's shared-compute model prompts other EU member states without a national LLM, starting with France, to follow the same adaptation playbook; and how Amalia's pt-PT-specific benchmark results, per the team's PROPOR 2026 paper, hold up against future frontier multilingual models.
Key Points
- 1Portugal launched Amalia on July 1, 2026, its first open large language model built for European Portuguese, for about 7 million euros.
- 2A university consortium adapted the open EuroLLM-9B base model rather than pretraining from scratch, keeping the project at research-grant scale.
- 3The launch offers a repeatable EU playbook for sovereign national language models, contrasting with France, which still lacks a public one.
Scoring Rationale
Amalia demonstrates a low-cost, adaptation-based path to a sovereign national LLM that other mid-size EU countries can replicate using shared EuroHPC compute, with visible high-level backing (the PM and two ministers attended the launch). Its technical scope, adapting an open 9-billion-parameter base rather than pretraining from scratch, is modest next to frontier releases, keeping its impact concentrated in AI-sovereignty and open-source policy circles rather than the broader model-capability conversation.
Sources
Public references used for this report.
Practice interview problems based on real data
1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.
Try 250 free problems
