What does it really take to run generative AI at scale? In this Google Cloud Partner AI Series episode, theCUBE Research’s Savannah Peterson sits down with Poonam Lamba, senior product manager of GKE AI inference and stateful workloads, Google Cloud, at Google, and Eddie Villalba, outbound product manager at Google Cloud, to unpack how Kubernetes — specifically GKE — is evolving to support enterprise AI inference with real-world impact.
Lamba shares how Google is meeting developers where they are, with tools such as the GKE Inference Gateway and custom compute classes. Eddie Villalba adds his perspective on how AI is “just another workload” — but with some important twists.
From dynamic scheduling to stateful services and network-aware storage, the discussion makes it clear: Kubernetes isn’t just powering the web anymore — it’s the foundation for AI at scale. Whether you’re deep into DevOps or exploring agentic AI, this episode offers a grounded look at what’s next for containerized intelligence.
Forgot Password
Almost there!
We just sent you a verification email. Please verify your account to gain access to
Google Cloud: Passport to Containers. If you don’t think you received an email check your
spam folder.
In order to sign in, enter the email address you used to registered for the event. Once completed, you will receive an email with a verification link. Open this link to automatically sign into the site.
Register For Google Cloud: Passport to Containers
Please fill out the information below. You will recieve an email with a verification link confirming your registration. Click the link to automatically sign into the site.
You’re almost there!
We just sent you a verification email. Please click the verification button in the email. Once your email address is verified, you will have full access to all event content for Google Cloud: Passport to Containers.
I want my badge and interests to be visible to all attendees.
Checking this box will display your presense on the attendees list, view your profile and allow other attendees to contact you via 1-1 chat. Read the Privacy Policy. At any time, you can choose to disable this preference.
Select your Interests!
add
Upload your photo
Uploading..
OR
Connect via Twitter
Connect via Linkedin
EDIT PASSWORD
Share
Forgot Password
Almost there!
We just sent you a verification email. Please verify your account to gain access to
Google Cloud: Passport to Containers. If you don’t think you received an email check your
spam folder.
In order to sign in, enter the email address you used to registered for the event. Once completed, you will receive an email with a verification link. Open this link to automatically sign into the site.
Sign in to gain access to Google Cloud: Passport to Containers
Please sign in with LinkedIn to continue to Google Cloud: Passport to Containers. Signing in with LinkedIn ensures a professional environment.
Are you sure you want to remove access rights for this user?
Details
Manage Access
email address
Community Invitation
Just Another Container: Demystifying Gen AI Inference on GKE | Google Cloud Passport to Containers
What does it really take to run generative AI at scale? In this Google Cloud Partner AI Series episode, theCUBE Research’s Savannah Peterson sits down with Poonam Lamba, senior product manager of GKE AI inference and stateful workloads, Google Cloud, at Google, and Eddie Villalba, outbound product manager at Google Cloud, to unpack how Kubernetes — specifically GKE — is evolving to support enterprise AI inference with real-world impact.
Lamba shares how Google is meeting developers where they are, with tools such as the GKE Inference Gateway and custom compute classes. Eddie Villalba adds his perspective on how AI is “just another workload” — but with some important twists.
From dynamic scheduling to stateful services and network-aware storage, the discussion makes it clear: Kubernetes isn’t just powering the web anymore — it’s the foundation for AI at scale. Whether you’re deep into DevOps or exploring agentic AI, this episode offers a grounded look at what’s next for containerized intelligence.
Just Another Container: Demystifying Gen AI Inference on GKE | Google Cloud Passport to Containers
What does it really take to run generative AI at scale? In this Google Cloud Partner AI Series episode, theCUBE Research’s Savannah Peterson sits down with Poonam Lamba, senior product manager of GKE AI inference and stateful workloads, Google Cloud, at Google, and Eddie Villalba, outbound product manager at Google Cloud, to unpack how Kubernetes — specifically GKE — is evolving to support enterprise AI inference with real-world impact.
Lamba shares how Google is meeting developers where they are, with tools such as the GKE Inference Gateway and custom...Read more
Savannah Peterson
Principal Analyst & HostSiliconANGLE Media, Inc.
HOST
Poonam Lamba
Senior Product Manager, GKE AI Inference & Stateful WorkloadsGoogle Cloud
Eddie Villalba
Outbound Product ManagerGoogle
What does it really take to run generative AI at scale? In this Google Cloud Partner AI Series episode, theCUBE Research’s Savannah Peterson sits down with Poonam Lamba, senior product manager of GKE AI inference and stateful workloads, Google Cloud, at Google, and Eddie Villalba, outbound product manager at Google Cloud, to unpack how Kubernetes — specifically GKE — is evolving to support enterprise AI inference with real-world impact.
Lamba shares how Google is meeting developers where they are, with tools such as the GKE Inference Gateway and cust...Read more