Vast.ai Launches Serverless GPU Optimization Platform

Vast.ai today announced Vast Serverless, a serverless orchestration layer that auto-benchmarks, optimizes, and predicts GPU load across heterogeneous auto-groups to route workloads in real time. The platform taps Vast.ai's marketplace of 17,000 GPUs from 1,300 providers and ~123,000 customers, promising continuous performance-per-dollar optimization, predictive scaling, SOC 2 security options, and developer debugging access for training and inference workloads.
Key Points
- 1Launches serverless orchestration that auto-benchmarks and routes workloads across 17,000 GPUs
- 2Reduces cost by continuously optimizing performance-per-dollar using predictive scaling and heterogeneous GPU selection
- 3Enables AI teams to scale training and inference cheaper, with real debugging and enterprise security options
Scoring Rationale
Practical serverless GPU orchestration with high usability and official release, but limited by vendor-specific marketplace reach.
Sources
Public references used for this report.
Practice with real Retail & eCommerce data
90 SQL & Python problems · 15 industry datasets
250 free problems · No credit card
See all Retail & eCommerce problems