Philip Kiely

Philip Kiely
Baseten company logo.
Lead Developer Advocate
Baseten

Presentation Title:

Real-Time, Real Problems:
Scaling AI in the Wild

Presentation Summary:

Real-time generative AI applications promise transformative user experiences—but scaling inference in production is a constant battle. Demand spikes, infrastructure meltdowns, and unpredictable model behaviors lurk behind every release.

Drawing from direct experience deploying mission-critical AI workloads at startups and large enterprises, this talk covers the four essential components of a production-ready inference platform: applied model performance research, distributed multi-cloud infrastructure, model management and observability tooling, and AI engineering expertise. I’ll highlight common pitfalls and share concrete advice on how to build robust, resilient systems that thrive under real-world demand.

Picture of About | Philip Kiely

About | Philip Kiely

Philip Kiely leads Developer Relations at Baseten.

Prior to joining Baseten in 2022, he worked across software engineering and technical writing for a variety of startups.

Outside of work, you'll find Philip practicing martial arts, reading a new book, or cheering for his adopted Bay Area sports teams.