Menu

1 Million Tokens Per Second on Kubernetes

1 Million Tokens Per Second on Kubernetes

Jul 28, 2026

Host:

Bart Farrell

Guest:

Federico Iezzi

Kubernetes Stories from the Trenches book

Kubernetes Stories from the Trenches

A book of battle-tested experiences from engineers who pushed Kubernetes to its limits and lived to tell the tale. Download

More Episodes

The Hidden Cost of Slow Autoscaling
with John Ford
The Namespaces Scaling Trap
with Brian Stack
AI Agents Running Kubernetes
with Mike Solomon
SaaS with Kubernetes Operators and Garbage Collection
with Alexander Held
What Hip-Hop Can Teach Us About Kubernetes
with Kelsey Hightower, Eric Abercrombie, and Julius Payne II

Interviews

Observability Before Kubernetes Changes
with Mesut Oezdil
Production Kubernetes With Claude
with Alex Burnett
Guarding Kubernetes Production Changes
with Max Majander
ToolHive and MCP on Kubernetes
with Juan Antonio Osorio
Fail Forward in Kubernetes
with Konrad Eriksson

Announcements

CloudBolt announces The Kubernetes Automation Trust Gap Study
with Yasmin Rajabi
Tintri announces OpenTelemetry Workload Visibility
with Phil Trickovic
StormForge Announces In-Place Pod Resizing and Cost Allocation GA
with Yasmin Rajabi
Google Cloud Donates llm-d, TPU Drivers, and More to CNCF
with Abdel Sghiouar
Spectro Cloud Announces Hadron Linux: A Minimal OS for Kubernetes
with Ettore Di Giacinto

Subscribe to KubeFM Weekly

Get the latest Kubernetes videos delivered to your inbox every week.

or subscribe via