Enterprises Adopt Cost-Effective RAG And Fine-Tuning

In late 2025, independent analyst Melissa argues most enterprises will use rather than build frontier models, favoring LoRA fine-tuning and retrieval-augmented generation (RAG) for customization and currency. She quantifies compute: frontier training requires 60,000–150,000 GPUs and $30M–$150M+, while fine-tuning runs on 4–24 GPUs for $300–$8,000. Enterprises commonly develop on neoclouds then migrate proven workloads on-premise.
Key Points
- 1Training requires 60,000–150,000 GPUs and $30M–$150M+, restricting model creation to a few labs.
- 2LoRA fine-tuning uses tiny adapters on 4–24 GPUs, costing $300–$8,000, enabling practical enterprise customization.
- 3Stack RAG with LoRA to keep facts current while preserving tone, runnable on small racks or neoclouds.
Scoring Rationale
Actionable enterprise guidance with concrete cost and compute figures, limited by analyst synthesis rather than primary research.
Sources
Public references used for this report.
Practice with real FinTech & Trading data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all FinTech & Trading problems
