Infrastructureai infrastructureopen sourcefundingtogether ai

Together AI Raises $800M to Scale Open Models

||By LDS Team
7.4
Relevance Score
Together AI Raises $800M to Scale Open Models
Photo: pymnts.com · rights & takedowns

Together AI raised an $800 million Series C on July 1, 2026 at an $8.3 billion post-money valuation, led by Aramco Ventures with NVIDIA, Vista Equity Partners, General Catalyst, Emergence Capital and others joining. The round underscores how enterprises running production AI workloads are turning to open-weight models like DeepSeek, Nemotron, MiniMax and Kimi to escape the margin-eating cost of closed frontier APIs. According to the company, annual bookings crossed $1.15 billion last quarter, it now serves thousands of paying customers including Cursor, Cognition and Decagon, and it has secured commitments for over 500 megawatts of compute capacity from investors to fund roughly 50-fold capacity growth over five years. CEO Vipul Ved Prakash said the goal is to make "intelligence... abundant, not expensive."

The size of this round, one of 2026's largest AI infrastructure raises, is itself a signal: late-stage investors are now pricing open-weight inference platforms as core production infrastructure rather than a cost-saving side option. For teams running LLM workloads at scale, that means more capital chasing the same problem practitioners already feel acutely, unpredictable and compounding inference bills, and a stronger case for evaluating open-model platforms as a primary rather than backup path.

What happened

Together AI announced an $800 million Series C at an $8.3 billion post-money valuation, according to the company's own press release. Aramco Ventures led the round, with Vista Equity Partners, General Catalyst, Emergence Capital, NVIDIA, March Capital, Pegatron, Schneider Electric's SE Ventures, Salesforce Ventures, DTCP Growth, Lux Capital, Geodesic and PSP Partners also participating. In its own blog post, CEO and co-founder Vipul Ved Prakash said the company secured commitments for over 500 megawatts of compute capacity, to be capitalized independently by investors, and that Together AI expects its capacity and infrastructure footprint to grow roughly 50-fold over the next five years. The company said annual bookings crossed $1.15 billion last quarter and that it now serves thousands of paying customers, including Cursor, Cognition, Decagon, Eleven Labs and Suno. Prakash said in the release: "Our mission is to ensure that intelligence is abundant, not expensive. The future of AI won't be owned by a few companies."

Technical context

Together AI positions itself as a full-stack platform, models plus kernels, compilers, inference systems and training infrastructure, for open-weight models such as DeepSeek, Nemotron, MiniMax, Kimi and GLM. The company points to research output including FlashAttention-4 for NVIDIA Blackwell and its Together Megakernel and together.compile tooling as the basis for its performance and cost claims. On customer savings, the company's own materials are not fully consistent: its blog post cites customers achieving "6x to 20x" lower costs versus closed models, while its press release states a wider "6x to 60x" range; both cite Decagon's sixfold reduction as a concrete example. Treat the upper end of that range as a company-reported outlier rather than a typical result.

For practitioners

Teams evaluating a move to an open-model inference vendor should budget for the real migration costs, observability, latency-tail management, and cost accounting across multi-tenant GPU fleets, rather than assuming a drop-in swap for a closed-model API. The specific savings multiple a given workload sees will depend heavily on model size, batching, and latency SLAs, so company-cited ranges should be treated as best-case anchors, not defaults.

What to watch

Whether Together AI's 500 MW compute buildout and 50-fold capacity target materialize on the stated five-year timeline, and whether investor-capitalized capacity (rather than direct balance-sheet spend) changes the unit economics it can pass on to customers. Also watch whether independent, standardized benchmarks emerge to validate the company's cost and performance claims, since current figures are customer-reported and company-published rather than third-party verified.

Key Points

  • 1Together AI's $800M Series C at an $8.3B valuation signals investors now treat open-model inference infrastructure as core production infrastructure, not a cost hack.
  • 2The company secured 500 MW of investor-backed compute commitments to fund a targeted 50-fold capacity expansion, betting resources will keep pace with open-model demand.
  • 3Customer savings claims (6x-20x per the company blog, 6x-60x per its press release) are internally inconsistent and company-reported, so practitioners should treat them cautiously.

Scoring Rationale

An $800M Series C at an $8.3B valuation is one of 2026's larger AI infrastructure raises and signals strong institutional confidence in open-weight model economics as production infrastructure. It is notable for ML/infra teams tracking inference cost and vendor consolidation but is a funding/business event rather than a technical breakthrough, so it sits just below the major-impact tier.

Sources

Public references used for this report.

5 sources

Practice interview problems based on real data

1,625 SQL & Python problems across 15 industry datasets — the exact type of data you work with.

Try 250 free problems