Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack

Dylan Patel·SemiAnalysis·AI·September 10, 2025

Nvidia announced the Rubin CPX, a solution that is specifically designed to be optimized for the prefill phase, with the single-die Rubin CPX heavily emphasizing compute FLOPS over memory bandwidth. This is a game changer for inference, and its significance is surpassed only by the March 2024 announcement of the GB200 NVL72 Oberon rack-scale form factor. Only with hardware specialized to the very different phases of inference, prefill and decode, can disaggregated serving achieve its full potent...

Read full article →

Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack

Related Articles