PitchCentric
The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering
Updated 10 days ago · Refreshed hourly
General

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering

Hosted by Unknown Host · EN · 5 episodes

Where this show ranks

Audience Score
0
Niche
Episodes
5
Last ep.
15 days ago
Avg length
10m
Booking Probability™
34
Stretch.
Sign in to score against your profile.
Estimated audience
,
Audience size not yet estimated
Listen Score
11
Niche reach.
Virality (30d)
42
Steady cadence.

Pitch Analysis

Sign in to see how your Guest Score compares to this show's Required Pod Score and get a Stretch / Match-fit / Anchor verdict.
Required Pod Score
80/ 100
Premium

Established thought leaders with verified media credentials.

Contact path
Verified email on file
Unlock verified contacts
Guest openness
Not signalled recently

About this podcast

Lucas and Luna cut through the noise around site reliability engineering to examine how real-world SRE teams balance uptime, incident response, and production change. Each episode takes a single concept — error budgets, toil automation, postmortem culture, capacity planning — and grounds it in a specific case: how a major streaming service reduced paging noise, how a payments platform rebuilt its incident command structure, or how a cloud provider manages multi-region failover. Lucas brings the numbers — latency percentiles, MTTR trends, SLO burn rates — while Luna pushes on the human and organizational trade-offs: What does a junior SRE need to know about on-call? How do you measure reliability without crushing innovation? Why do some blameless postmortems actually work? Together they treat SRE not as a certification topic but as a living practice, citing real outages, open-source tools, and engineering blogs. This show is for engineers, ops leads, and platform teams who already know the basics and want to debate the hard edges: Is 99.999% uptime always worth the cost? When should you deliberately degrade service to improve reliability? How do you design for resilience when your system is already in production? Lucas and Luna don't pretend to have final answers — they build the conversation so you can draw your own. If you've ever argued about whether a page was necessary or whether an SLO should be tightened, this is your show.

About the host

Unknown Host hosts The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering, a general show with 5 episodes published.

Verified host email:████████@████.comSign in to unlock →

Recent episodes

Our AI reads these to draft pitches

How SRE Teams Use Runbook Automation to Reduce Human Error

Jun 5, 20268mEp. 33S1

In this episode of The Site Reliability Podcast, Lucas and Luna dive into the practical side of runbook automation — moving beyond static documentation to executable, automated responses. They explore how companies like

How SRE Teams Use Cost Optimization to Balance Performance and Budget

Jun 5, 20266mEp. 32S1

In this episode of The Site Reliability Podcast with Fexingo, Lucas and Luna dive into the often-overlooked intersection of site reliability engineering and cloud cost optimization. They explore how SRE teams at companie

How SRE Teams Use Load Shedding to Survive Traffic Spikes

Jun 4, 20269mEp. 31S1

When a massive traffic spike hits, every millisecond of latency can cost thousands of dollars. In this episode, Lucas and Luna explore load shedding — the SRE technique of intentionally dropping non-critical requests to

How SRE Teams Use Feature Flags to Reduce Incident Risk

Jun 4, 202611mEp. 30S1

Feature flags are a powerful tool for SREs, but they come with their own operational risks. In this episode, Lucas and Luna explore how companies like Etsy, Netflix, and LaunchDarkly use feature flags to decouple deploym

How SRE Teams Use Incident Metrics to Reduce Mean Time to Resolve

Jun 3, 20266mEp. 29S1

In episode 29 of The Site Reliability Podcast, Lucas and Luna dive into the specific metrics SRE teams use to reduce mean time to resolve (MTTR) during incidents. They break down the difference between mean time to ackno

How Cloud SREs Use Circuit Breakers to Prevent Cascading Failures

Jun 3, 202614mEp. 28S1

When a single service fails, the whole system shouldn't collapse. In this episode, Lucas and Luna dive into the circuit breaker pattern — a critical resilience tool in site reliability engineering. They break down how Ne

How SREs Use Error Budgets to Balance Reliability and Velocity

Jun 2, 20268mEp. 27S1

In this episode of The Site Reliability Podcast, Lucas and Luna dive into the practical mechanics of error budgets — the SRE tool that lets teams trade reliability for feature velocity without breaking trust. They walk t

How SRE Teams Use Game Days to Build Muscle Memory for Incidents

Jun 2, 20268mEp. 26S1

In Episode 26 of The Site Reliability Podcast, Lucas and Luna explore how SRE teams run 'game days' — simulated incident exercises — to build muscle memory and reduce panic during real outages. They break down how Etsy,

How SRE Teams Use Error Budgets to Balance Reliability and Velocity

Jun 1, 20268mEp. 25S1

In this episode of The Site Reliability Podcast, Lucas and Luna explore how SRE teams use error budgets to make smart trade-offs between reliability and feature velocity. They break down the concept with concrete example

SRE Runbooks That Actually Get Followed

Jun 1, 202611mEp. 24S1

Most SRE teams have runbooks. Few have runbooks that engineers actually use in the middle of an incident. Lucas and Luna dive into why the typical runbook fails — too long, too vague, or written for the person who alread

How SRE Teams Use Observability to Reduce Mean Time to Acknowledge

May 31, 20268mEp. 23S1

Mean time to acknowledge (MTTA) is the clock that starts when an alert fires and stops when an engineer clicks 'ack'. For most teams, that gap is the single biggest waste of incident response time. In this episode, Lucas

How SRE Teams Use Synthetic Monitoring to Catch Outages First

May 31, 202611mEp. 22S1

Episode 22 of The Site Reliability Podcast explores synthetic monitoring — proactive testing that catches outages before real users feel them. Lucas and Luna break down how companies like Etsy and Twilio simulate user jo

How SRE Teams Use Traffic Shadowing for Safe Testing

May 30, 202611mEp. 21S1

In this episode of The Site Reliability Podcast, Lucas and Luna explore traffic shadowing: a technique that lets SRE teams test new services with live production traffic without affecting real users. They break down how

How SRE Teams Use Canary Deployments to Reduce Blast Radius

May 30, 202610mEp. 20S1

In this episode of The Site Reliability Podcast, Lucas and Luna dive into the practice of canary deployments—a key strategy for reducing blast radius in production. They break down how teams like Etsy and Netflix use pha

How SRE Teams Use Data to Predict Incidents Before They Happen

May 29, 20267mEp. 19S1

Most incident response is reactive—you get paged, you triage, you fix. But a growing number of SRE teams are flipping the model: using historical data, machine learning, and anomaly detection to predict incidents before

How SRE Teams Use Capacity Planning to Prevent Black Friday Outages

May 29, 20268mEp. 18S1

In this episode, Lucas and Luna explore how site reliability engineering teams use capacity planning to avoid catastrophic outages during peak traffic events like Black Friday and Cyber Monday. They break down the specif

How SRE Teams Use Service Level Objectives to Drive Business Decisions

May 28, 202610mEp. 17S1

Lucas and Luna explore how service level objectives (SLOs) have evolved from a technical metric into a strategic business tool. Using examples from Google, Etsy, and a mid-size fintech startup, they show how SLOs help SR

How SRE Teams Use Toil Budgets to Prioritise Automation

May 28, 20266mEp. 16S1

Episode 16 of The Site Reliability Podcast explores toil budgets: the SRE practice of capping manual, repetitive work so teams have time for automation. Lucas and Luna break down how Google defined toil in its SRE book,

How SRE Teams Handle On-Call Burnout Without Burning Out

May 27, 202613mEp. 15S1

Episode 15 of The Site Reliability Podcast with Fexingo dives into the human side of site reliability engineering: on-call burnout. Lucas and Luna explore how teams at companies like Etsy and Honeycomb use structured rot

How SRE Teams Use Chaos Engineering for Non-Netflix Systems

May 27, 20268mEp. 14S1

Lucas and Luna explore how site reliability engineers adapt chaos engineering beyond Netflix's famous Simian Army. The episode focuses on a mid-size e-commerce company, BlinkMart, which used controlled failure injection

Sponsors and advertisers

Sponsor detection runs nightly. Check back soon.

Audience demographics

Age
25-54
Consumer type
General audience

Successful pitch examples

No public pitch examples yet for this show.

Generate your own personalised pitch

If you're pitching The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering, also consider

Shows with the most semantically similar episode content. Pitch one, pitch all; producers cluster.

Frequently asked questions

How do I pitch The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering as a podcast guest?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering has a verified contact on file. Create a free PitchCentric account to access it and generate a personalised pitch in seconds. Research at least 3 recent episodes first and lead with a specific angle that serves their general audience.

Who is the host of The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering is hosted by Unknown Host. The show is categorised under General and has published 5 episodes.

How many episodes does The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering have?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering has published 5 episodes.

Is it hard to get booked on The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering is accessible for guests with genuine general expertise. A personalised, episode-aware pitch will still outperform a generic one every time.

Is The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering currently accepting guest pitches?

The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering hasn't explicitly signalled guest openness in recent episodes. That doesn't rule out pitching. your hook just needs to be especially compelling and relevant to their recent content.

How long are The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering episodes?

Episodes of The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering average 10 minutes. a focused format where a clear narrative arc and tight preparation matter most.

What guest credentials does The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering typically look for?

Our data rates The Site Reliability Podcast with Fexingo: SRE, Uptime, and Production Engineering's guest bar at 80/100 (Premium tier). Established thought leaders with verified media credentials. Sign in to PitchCentric to see how your own Pod Score compares against this show.

Methodology. Booking Probability™ blends Listen Score, 30-day Virality, open-to-guests detection, and Apple ratings. Data refreshed every 60 minutes. Listen Score and Booking Probability are calculated by PitchCentric. Last enriched 10 days ago.

Is this podcast yours and you'd like to remove or correct details? Request removal or email privacy@pitchcentric.com.