• Episode 11: Sylvain Kalache
    Apr 8 2026
    AI agents are triaging incidents and writing runbooks- but are LLMs actually the right tool for operational work? Sylvain Kalache, Head of AI Labs at Rootly, shares research on where AI SRE tools add real value, where they fall apart, and what it means for operational maturity when humans only see the hardest problems. Guest: Sylvain Kalache, Head of AI Labs at Rootly (https://rootly.com). Show Notes Available at https://podcast.certomodo.io/sylvain-kalache.html.
    Show More Show Less
    54 mins
  • Episode 10: Kyle Forster
    Mar 10 2026
    Explores the 'AI code tsunami' and how massive, AI-generated code changes are forcing engineering teams to rethink traditional code reviews, observability, and the future of SRE roles. The conversation highlights a shift toward treating test environments like production and using narrowly scoped AI agents to manage system reliability, guided by simplified, binary SLIs and SLOs. Guest: Kyle Forster, founder and CEO of RunWhen (https://runwhen.com). Show Notes Available at https://podcast.certomodo.io/kyle-forster.html.
    Show More Show Less
    52 mins
  • Episode 9: Jon Reeve
    Dec 14 2025
    Discusses the 'complexity cult' of the current observability industry, how the open-source TUI tool Gonzo can reveal infrastructure insights using novel use of LLMs for sentiment analysis, and the vision of more accessible observability experiences for software engineers. Guest: Jon Reeve, founder and CPO of ControlTheory (controltheory.com). Show Notes Available at https://podcast.certomodo.io/jon-reeve.html.
    Show More Show Less
    42 mins
  • Episode 8: Aaron 'Checo' Pacheco
    Oct 29 2025
    Explores monitoring and observability evolution, examining how observability costs now consume 15-25% of infrastructure budgets with Aaron Pacheco from Ottermon.ai. Show Notes Available at https://podcast.certomodo.io/aaron-pacheco.html.
    Show More Show Less
    53 mins
  • Episode 7: Sebastian Vietz
    Sep 7 2025
    Discusses how naming conventions shape industry perceptions, with focus on AI SRE terminology with Sebastian Vietz from Compass Digital. Show Notes Available at https://podcast.certomodo.io/sebastian-vietz.html.
    Show More Show Less
    1 hr and 6 mins
  • Episode 6: Chris Evans
    Jul 13 2025
    Explores whether automation through AI actually reduces toil or just shifts it elsewhere with Chris Evans from Incident.io. Show Notes Available at https://podcast.certomodo.io/chris-evans.html.
    Show More Show Less
    51 mins
  • Episode 5: Derek Brown
    Mar 31 2025
    Compares infrastructure management at large tech companies versus smaller organizations with Derek Brown from Plaid. Show Notes Available at https://podcast.certomodo.io/derek-brown.html.
    Show More Show Less
    55 mins
  • Episode 4: Kat Gaines
    Mar 14 2025
    Examines incident management beyond technical fixes, emphasizing communication and customer experience with Kat Gaines from PagerDuty. Show Notes Available at https://podcast.certomodo.io/kat-gaines.html.
    Show More Show Less
    59 mins