Tag
#oncall
12 recipes tagged #oncall.
- ops 1 hour to set up · advanced
Nightly Routine: draft PRs for low-risk silent bugs
Routine runs at 2am, picks the top silent bug, investigates it, opens a draft PR if the fix is small and contained. Lights-out triage for the next morning's review.
#errors#routine#automation#auto-fix - debugging 30 seconds · beginner
Find what broke after your last deploy
Diff error fingerprints before and after the latest deploy SHA. Surface regressions in 30 seconds, including a candidate fix.
#errors#deploys#post-deploy#regression - debugging 2 minutes · intermediate
Mobile background-sync failure monitoring
Background sync, refresh, prefetch jobs fail silently and degrade UX. Group failures by task_name and network_type to find the patterns.
#errors#mobile-app#background-jobs#infra - debugging 2 minutes · advanced
Find connection-pool / timeout patterns in errors
Surface DB connection exhaustion, statement timeouts, and deadlock retries. Find the original slow query starving everything else.
#errors#database#infra#scaling - debugging 30 seconds · beginner
Find what broke after the last npm update
Tie new error fingerprints to the deploy that bumped dependencies. Spot the silent regression nobody intended to ship.
#errors#regression#dependencies#deploys - debugging 1 minute · beginner
Compare error rates between staging and prod
Diff fingerprints between staging and production. Find what's already breaking in staging that the next deploy will carry into prod.
#errors#environments#pre-deploy#staging - debugging 1 minute · intermediate
Diagnose a flaky CI test from its error payloads
Group test failures across CI runs by error pattern. Get a verdict — real bug or timing — without the manual archaeology.
#errors#ci#testing#flaky-tests - debugging 1 minute · intermediate
Catch display-vs-actual-stock mismatches
Find inventory sync errors trending up by warehouse and SKU before customers do. The 'in stock' display lie is one of the highest-cost bugs in ecommerce.
#errors#inventory#ecommerce#infra - debugging 1 minute · intermediate
Detect a 429-spike before customers tweet about it
Trend HTTP 429 responses week-over-week and flag spikes. Catch third-party rate-limit tightening or a runaway customer integration.
#errors#rate-limiting#early-warning#datadog-alternative - debugging 10 seconds · beginner
Find the bugs 5 people hit but no one reported
Surface real errors that affect multiple users but never get filed as tickets. The biggest source of silent churn.
#errors#retention#saas#sentry-alternative - ops 5 minutes · intermediate
Watch staging — page only on genuinely new errors
Set up a scheduled Routine that diffs error fingerprints each hour and only pages oncall when a genuinely new failure mode appears.
#errors#post-deploy#webhooks#automation - ops 1 minute · beginner
Stop noisy errors from paging you — in plain English
Suppress third-party SDK timeouts, ad-blocker errors, and other false-positives by describing them. No regex required.
#errors#suppressions#sentry-alternative#saas