Tim Piemonte, Tribeca Softech & Ankur Patel, Multimodal
In this theCUBE + NYSE Wired: Mixture of Experts interview, Robert Brooks IV, founding team member at Lambda, joins theCUBE’s John Furrier to unpack the realities of scaling AI infrastructure as enterprise demand surges. Brooks details Lambda’s $480M Series D equity round (taking total funding to “over $800M”), participation from investors including NVIDIA and why capital intensity, power density and liquid cooling (50–150 kW per rack) are redefining data center strategy. He shares how Lambda abstracts DevOps for math-first ML teams with a plug-and-play stack, one-click clusters that spin up hundreds of GPUs on NVIDIA InfiniBand, and an inference API with no rate limiting – enabling POCs that seamlessly graduate to production. The conversation ties tech execution to financial outcomes: from securing megawatts and supply chain to early access on Blackwell (a B200 test cluster planned around GTC) so customers can move faster with predictable economics. The discussion also explores market-shaping enterprise strategies at the intersection of tech and finance: DeepSeek’s “test-time compute” moment, open-source momentum (and why transparency and controllability matter) and the shifting cost curve – billions to 10x training vs. ~13 cents more per token for reasoning at inference. Brooks explains how Lambda’s “platform engineering for ML” meets teams where they build – managed Kubernetes/Slurm, full lifecycle from training to inference and developer-controlled scale across thousands of GPUs. Real-world signals include material-science ML wins, enterprises hosting open-source models on Lambda’s inference API and hands-on R&D from a video model leaderboard to a humanoid robot. He closes on focus as strategy – saying “no” to non-AI workloads to move faster for one customer profile – and the five-minute, credit-card path to get started.