Russ d'Sa, LiveKit
This conversation explores voice and multimodal artificial intelligence, AI, infrastructure for agentic applications and the technical and product factors to consider when building conversational agents that can perceive and act. The discussion highlights open-source streaming infrastructure, real-world enterprise deployments and safety approaches for non-deterministic interactions. Russ d'Sa of LiveKit is the chief executive officer and co-founder. D'Sa describes LiveKit’s open-source streaming platform and how it enables multimodal AI agents that can see, hear and speak. Gemma Allen of theCUBE hosts and theCUBE Research produces the interview for NYSE Wired at the AI Agent Conference 2026. D'Sa traces LiveKit’s origins during the COVID-19 pandemic and explains the company’s role in scaling ChatGPT Voice, its rapid commercial growth, and the technical challenges of building agentic applications. Key takeaways include D'Sa’s emphasis on simulation-based testing to ensure safety and reliability for conversations driven by large language model, LLM, systems. D'Sa highlights a market bifurcation between novel consumer assistants and goal-oriented enterprise voice AI and argues that platform tooling and the entire development lifecycle—development, testing, deployment, scaling and observation—must be redesigned to make voice AI applications as easy to build as web applications. Relevant topics covered: voice AI, multimodal AI, streaming platform, open-source infrastructure, agentic applications, AI infrastructure, simulation-based testing, safety, enterprise voice AI, ChatGPT Voice, large language model systems.