Tools-in-Action30min
Listen to Production the Way It Deserves
This talk reimagines software reliability through agentic AI. Moving beyond static alerts, AI agents can interpret production signals, detect weak issues early, and assist incident analysis. Drawing on SRE experience, it examines observability bottlenecks, the value and limits of AI in reliability, and the evolving partnership between humans and intelligent systems.
Poone Mokariewake
talkDetail.whenAndWhere
Wednesday, April 22, 17:00-17:30
TBA 11
talks.roomOccupancytalks.noOccupancyInfo
For years, the way we monitor and alert on production systems has remained reactive. We define assumptions, set thresholds, and wait for alerts to fire, often too late, too early or too often.As systems grow, this model creates noise, operational toil, and burnout, while still failing to capture what really matters in production.
In this talk, we will explore how agentic AI can change the way we approach software reliability. Instead of relying solely on predefined rules and static alerts, AI agents can continuously observe production signals, reason about system behavior, and help engineers detect and understand issues earlier.
Drawing from real-world SRE experience, we will look at the current bottlenecks of observability and alerting, and how AI agents can assist in identifying problems, surfacing weak signals, and supporting incident analysis. We will also discuss the strengths and limitations of AI agents in reliability systems: where they add real value, where they fall short, and why human expertise remains essential.
Finally, we will explore what the future of reliability could look like when engineers shift from reacting to alerts to truly listening to production
In this talk, we will explore how agentic AI can change the way we approach software reliability. Instead of relying solely on predefined rules and static alerts, AI agents can continuously observe production signals, reason about system behavior, and help engineers detect and understand issues earlier.
Drawing from real-world SRE experience, we will look at the current bottlenecks of observability and alerting, and how AI agents can assist in identifying problems, surfacing weak signals, and supporting incident analysis. We will also discuss the strengths and limitations of AI agents in reliability systems: where they add real value, where they fall short, and why human expertise remains essential.
Finally, we will explore what the future of reliability could look like when engineers shift from reacting to alerts to truly listening to production
Poone Mokari
Co-founder and CEO of ewake.ai, Poone Mokari builds AI agents focused on software reliability. Before founding ewake.ai, she worked as a Site Reliability Engineer at companies such as Criteo, where she gained hands-on experience operating and scaling large production systems, and on-call duties.
With a background rooted in SRE and DevOps, she is particularly interested in how AI agents can empower engineering teams reduce operational load and improve reliability at scale.
talkDetail.shareFeedback
talkDetail.feedbackNotYetAvailable
talkDetail.feedbackAvailableAfterStart
talkDetail.signInRequired
talkDetail.signInToFeedbackDescription
occupancy.title
occupancy.votingNotYetAvailable
occupancy.votingAvailableBeforeStart
talkDetail.signInRequired
occupancy.signInToVoteDescription
comments.speakerNotEnabledComments