Updated 6 days ago · Refreshed hourly

business

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering

Hosted by Fexingo · 🇺🇸 US · EN · 117 episodes

Apple Podcasts Website

Where this show ranks

Audience Score

Niche

Episodes

117

Last ep.

today

Avg length

10m

Language

Booking Probability™

Stretch.
Sign in to score against your profile.

Estimated audience

Audience size not yet estimated

Listen Score

Niche reach.

Virality (30d)

Steady cadence.

Pitch Analysis

Sign in to see how your Guest Score compares to this show's Required Pod Score and get a Stretch / Match-fit / Anchor verdict.

Required Pod Score

80/ 100

Premium

Established thought leaders with verified media credentials.

Contact path

Verified email on file

Unlock verified contacts

Guest openness

Not signalled recently

Best topics to pitch

business

What is The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

Lucas and Luna cut through the noise around site reliability engineering to examine how real-world SRE teams balance uptime, incident response, and production change. Each episode takes a single concept — error budgets, toil automation, postmortem culture, capacity planning — and grounds it in a specific case: how a major streaming service reduced paging noise, how a payments platform rebuilt its incident command structure, or how a cloud provider manages multi-region failover. Lucas brings the numbers — latency percentiles, MTTR trends, SLO burn rates — while Luna pushes on the human and organizational trade-offs: What does a junior SRE need to know about on-call? How do you measure reliability without crushing innovation? Why do some blameless postmortems actually work? Together they treat SRE not as a certification topic but as a living practice, citing real outages, open-source tools, and engineering blogs. This show is for engineers, ops leads, and platform teams who already know

business

About the host

Fexingo hosts The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering, a business show with 117 episodes published.

Verified host email:████████@████.comSign in to unlock →

Recent episodes

Our AI reads these to draft pitches

How SRE Teams Use Blameless Postmortems to Improve Reliability

Jul 24, 20269mEp. 129S3

Episode 129 of The Site Reliability Podcast explores how SRE teams use blameless postmortems to drive systemic improvements without creating a culture of fear. Lucas and Luna dive into the specific practices at a major t

How SRE Teams Use Runbooks to Reduce Incident Response Time

Jul 23, 202613mEp. 128S3

Episode 128 of The Site Reliability Podcast dives into the unsung hero of incident response: runbooks. Lucas and Luna break down how GitLab reduced their mean time to mitigate by 40% by replacing tribal knowledge with st

How SRE Teams Use Cost-to-Serve Analysis to Optimize Infrastructure

Jul 23, 202611mEp. 127S3

In this episode of The Site Reliability Podcast, Lucas and Luna explore how SRE teams are applying cost-to-serve analysis to optimize cloud infrastructure spending. Using a real case from a mid-size fintech company that

How SRE Teams Use Rolling Backups to Recover From Ransomware

Jul 22, 20268mEp. 126S3

Lucas and Luna explore how site reliability teams are reframing backup strategy in an era of ransomware and silent data corruption. Lucas breaks down the 3-2-1-1 rule — three copies, two media types, one offsite, one imm

How SRE Teams Use Synthetic Monitoring to Catch Outages Before Users Do

Jul 22, 20269mEp. 125S3

In this episode of The Site Reliability Podcast with Fexingo, Lucas and Luna dive into synthetic monitoring — a proactive approach where SRE teams simulate user traffic to detect issues before they impact real customers.

How SRE Teams Use Capacity Planning to Prevent Outages Before They Happen

Jul 21, 20268mEp. 124S3

Lucas and Luna dive into the science of capacity planning for site reliability engineering. They break down how Netflix uses predictive modeling to scale infrastructure ahead of demand spikes, avoiding the kind of cascad

How SRE Teams Use Feature Flags to Control Risk in Production

Jul 21, 202610mEp. 123S3

Lucas and Luna explore how feature flags give SRE teams a kill switch for new code, reducing blast radius and enabling canary-style testing without full redeploys. They break down the case of a major e-commerce platform

How SRE Teams Use Graceful Degradation to Keep Services Running

Jul 20, 20267mEp. 122S3

In this episode of The Site Reliability Podcast, Lucas and Luna explore how SRE teams design systems to degrade gracefully rather than fail catastrophically. They examine Netflix's Hystrix library, which introduced circu

How SRE Teams Use Canary Deployments to Reduce Blast Radius

Jul 19, 202611mEp. 121S3

In this episode of The Site Reliability Podcast, Lucas and Luna explore how canary deployments help SRE teams catch issues early and limit impact. They break down the real-world mechanics behind gradual rollouts—how Netf

How SRE Teams Use Traffic Shadowing to Test in Production

Jul 19, 202610mEp. 120S3

In episode 120 of The Site Reliability Podcast with Fexingo, Lucas and Luna explore traffic shadowing—a technique that lets SRE teams test new code against live production traffic without affecting real users. They break

How SRE Teams Use Observability Signals to Diagnose Production Issues

Jul 18, 20267mEp. 119S3

In this episode of The Site Reliability Podcast, Lucas and Luna explore how SRE teams leverage observability signals — logs, metrics, and traces — to diagnose production issues faster. They walk through a real example fr

How SRE Teams Use Load Shedding to Protect Critical Services

Jul 16, 202612mEp. 115S3

Site reliability engineers have a last-resort play when traffic overwhelms their systems: load shedding. In this episode, Lucas and Luna explore how companies like Google and Amazon use intentional, tiered request-droppi

How SRE Teams Use Toil Budgets to Automate the Right Things

Jul 16, 202610mEp. 114S3

Episode 114 of The Site Reliability Podcast explores a concept that separates mature SRE teams from the rest: the toil budget. Lucas and Luna break down how teams at companies like Google and Shopify explicitly cap the a

How SRE Teams Use Fault Trees to Root Out Latent Defects

Jul 15, 20268mEp. 113S3

In Episode 113 of The Site Reliability Podcast, Lucas and Luna explore how fault tree analysis — a technique borrowed from aerospace and nuclear engineering — is being adapted by SRE teams to uncover latent defects befor

How SRE Teams Use Game Days to Build Incident Muscle Memory

Jul 15, 20268mEp. 112S3

In this episode of The Site Reliability Podcast with Fexingo, Lucas and Luna dive into the practice of Game Days—simulated failure exercises that help SRE teams build muscle memory without risking production. They walk t

How SRE Teams Use Error Budgets to Balance Reliability and Velocity

Jul 14, 202610mEp. 111S3

Lucas and Luna dive into error budgets, the SRE mechanism that lets teams decide how much downtime is acceptable. They explore Google's original framework, how it resolves the tension between feature velocity and system

How SRE Teams Use Incident Metrics to Improve Postmortem Quality

Jul 14, 20269mEp. 110S3

Site reliability engineers write postmortems after every significant incident, but not all postmortems are created equal. In this episode, Lucas and Luna explore how leading SRE teams apply quantitative metrics — like ti

How SRE Teams Use Chaos Engineering to Find Hidden Failure Modes

Jul 13, 20269mEp. 109S3

In this episode of The Site Reliability Podcast, Lucas and Luna dive into chaos engineering as a proactive practice for uncovering hidden failure modes in production systems. They discuss how Netflix pioneered the approa

How SRE Teams Use Latency SLOs to Improve User Experience

Jul 13, 20266mEp. 108S3

Lucas and Luna explore how site reliability engineering teams set and enforce latency Service Level Objectives to prevent slow page loads from driving users away. Using examples from Google Search and e-commerce, they ex

How SRE Teams Use Incident Postmortems for Systemic Improvement

Jul 12, 202610mEp. 107S3

In this episode of The Site Reliability Podcast, Lucas and Luna explore how incident postmortems go beyond blame to drive systemic reliability improvements. They examine the anatomy of a good postmortem, including the 'f

Advertisers on this show

Detected from recent episode content. Sponsor presence is a real signal of listener purchasing power and show monetisation.

Fexingo2 eps· last 8 days ago

buymeacoffee.com/fexingo· last 8 days ago

FexingoBusiness· last 8 days ago

buymeacoffee· last 8 days ago

Audience demographics

Age

25-54

Consumer type

Professionals & Founders

Topics covered

business

Successful pitch examples

No public pitch examples yet for this show.

Generate your own personalised pitch

Best industries to pitch The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering for

Based on semantic analysis of episode topics and host coverage, this show is a strong guest fit for executives in:

Gaming & Esportsmoderate Data Analytics & BImoderate Professional Developmentmoderate B2B SaaS / Email Marketing & Growthmoderate Operations & Process Managementmoderate

Industry fit is computed by PitchCentric using vector embeddings of the show's episode catalog.

Similar podcasts to The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering

Shows with the most semantically similar episode content. Pitch one, pitch all; producers cluster.

The Software Engineering Podcast with Fexingo: Code, Architecture, and Engineering Best Practices

Fexingo

29~~415 listeners83% similar

Incidentally Reliable Podcast

Zenduty

18~~255 listeners81% similar

Reliability Enablers

Ash Patel & Sebastian Vietz

26~~371 listeners81% similar

DevOps Daily with Fexingo: CI/CD, Kubernetes, and Modern Software Operations

Fexingo

29~~415 listeners80% similar

Reliability Rebels

Amin Astaneh

16~~229 listeners79% similar

Google SRE Prodcast

Salim Virji

25~~357 listeners78% similar

The Web Development Podcast with Fexingo: Frontend, Backend, and Modern Web Stack

Fexingo

29~~415 listeners77% similar

The Developer Tools Podcast with Fexingo: APIs, Infrastructure, and Software for Engineers

Fexingo

28~~404 listeners77% similar

Frequently asked questions

How do I pitch The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering as a podcast guest?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering has a verified contact on file. Create a free PitchCentric account to access it and generate a personalised pitch in seconds. Research at least 3 recent episodes first and lead with a specific angle that serves their business audience.

Who is the host of The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering is hosted by Fexingo. The show is categorised under business and has published 117 episodes.

How many episodes does The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering have?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering has published 117 episodes.

What topics does The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering cover?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering regularly covers business. It sits in the business category.

Is it hard to get booked on The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering is accessible for guests with genuine business expertise. A personalised, episode-aware pitch will still outperform a generic one every time.

Is The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering currently accepting guest pitches?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering hasn't explicitly signalled guest openness in recent episodes. That doesn't rule out pitching. your hook just needs to be especially compelling and relevant to their recent content.

How long are The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering episodes?

Episodes of The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering average 10 minutes. a focused format where a clear narrative arc and tight preparation matter most.

What guest credentials does The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering typically look for?

Our data rates The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering's guest bar at 80/100 (Premium tier). Established thought leaders with verified media credentials. Sign in to PitchCentric to see how your own Pod Score compares against this show.

Methodology. Booking Probability™ blends Listen Score, 30-day Virality, open-to-guests detection, and Apple ratings. Data refreshed every 60 minutes. Listen Score and Booking Probability are calculated by PitchCentric. Last enriched 6 days ago.

Podcasts like The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering

All podcasts More business Top in US Top 100 by Booking Probability Booking Probability Index How scoring works How to get on podcasts

Is this podcast yours and you'd like to remove or correct details? Request removal or email privacy@pitchcentric.com.

Pitch Analysis

Sign in to see how your Guest Score compares to this show's Required Pod Score and get a Stretch / Match-fit / Anchor verdict.

Required Pod Score

80/ 100

Premium

Established thought leaders with verified media credentials.

Contact path

Verified email on file

Unlock verified contacts

Guest openness

Not signalled recently

Best topics to pitch

business

What is The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

business

Frequently asked questions

How do I pitch The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering as a podcast guest?

Who is the host of The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering is hosted by Fexingo. The show is categorised under business and has published 117 episodes.

How many episodes does The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering have?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering has published 117 episodes.

What topics does The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering cover?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering regularly covers business. It sits in the business category.

Is it hard to get booked on The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

Is The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering currently accepting guest pitches?

How long are The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering episodes?

Episodes of The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering average 10 minutes. a focused format where a clear narrative arc and tight preparation matter most.