Sid Sheth, Founder, d-Matrix
In this conversation at theCUBE + NYSE Wired: AI Factories – Data Centers of the Future, theCUBE’s John Furrier sits down with Vipul Prakash, co-founder and CEO of Together AI, to explore how AI factories are redefining enterprise infrastructure. Prakash details the breakneck growth of AI-native applications – where usage that once took SaaS nine months to double is now happening in nine days – and why this demand is forcing a rethink of compute, storage and networking. He explains how leading apps segment traffic across closed APIs and fine-tuned open-source models, and why efficiency, scale and low latency make AI factories the new unit of value in the enterprise stack. The discussion highlights the critical role of data adjacency (fabric-connected, parallel storage placed next to models), continuous testing for gen AI (from entropy checks to A/B rollouts and ECC/network reliability), and the organizational advantage of appointing a chief AI officer to cut through policy and security inertia. Prakash also shares how Together AI is addressing real-world constraints, most notably power, and reveals progress building new AI factories in Maryland and Memphis, with more on the way. The interview further dives into AI-scale architecture trends (transformer evolution, sharded attention/experts) and enterprise use cases, from digital-native builders to regulated sectors. Prakash outlines Together AI’s end-to-end model: from energized data centers and GPU-dense infrastructure to developer experience, sovereignty and security, aimed at producing more tokens from the same footprint. He closes with a look at unifying training and inference on shared infrastructure to smooth peak/average loads and improve economics, plus customer momentum ranging from Zoom to VFS (visa processing with 156 governments). It’s a pragmatic blueprint for how AI factories are becoming the cornerstone of digital strategy – and the most profound infrastructure shift since the dawn of the internet.