John Mao, VAST Data | CES 2026
In this interview from CES 2026, John Mao, vice president of global strategic alliances at VAST Data, joins theCUBE’s Rob Strechay to unpack VAST's pivotal role in NVIDIA’s Vera Rubin system announcement. The discussion centers on the reinvention of the AI stack, specifically the evolution of KV cache storage to support larger models and longer reasoning capabilities. Mao explains how VAST is moving beyond the limitations of local high-bandwidth memory by utilizing NVIDIA’s BlueField-4 DPUs and Spectrum-X networking to create an infinitely scalable pool of NVMe storage. This architecture enables context memory to extend across the network with high bandwidth and low latency via RDMA, fundamentally changing how data feeds the GPU. The conversation also explores the broader implications of these infrastructure advancements for the "AI Everywhere" era, bridging the gap between data center innovation and consumer applications. Mao highlights how this shared-everything architecture impacts industries ranging from sports and media entertainment to robotics and physical AI, allowing for the democratization of unstructured data analysis. Additionally, they touch upon the manufacturing and packaging simplifications of the new supercomputing generation, underscoring how these developments are accelerating enterprise adoption of AI in production environments.