Google and NVIDIA Collaborate to Slash AI Inference Costs with Advanced Infrastructure
At the Google Cloud Next event, Google and NVIDIA unveiled a new hardware and software architecture aimed at drastically reducing the cost and increasing the efficiency of AI inference at scale, promising up to tenfold improvements in cost per token and token throughput.
