Google Cloud Unveils Major AI Hypercomputer Software Upgrades

Google Cloud announced significant software upgrades to its AI Hypercomputer, dramatically improving AI model training and inference efficiency. Pathways on Cloud, a distributed runtime, is now available on Google Cloud, enabling elastic training and high-throughput inference. Cluster Director adds Slurm support and 360° observability features for high performance and reliability. GKE integrates Inference Gateway and Inference Quickstart, slashing inference costs and boosting throughput. vLLM now supports TPUs, further accelerating inference. Dynamic Workload Scheduler expands accelerator support, optimizing resource utilization. These upgrades empower developers to build and deploy AI applications faster and more cost-effectively.