Skip to content

Quality Metrics & Health Signals

What pass rate, flaky rate, and other metrics mean, and how to interpret them for release decisions.

Certyn tracks several metrics that help you understand the health of your test suite and the risk profile of your environments.

Key metrics

MetricWhat it measuresTargetRed flag
Pass rate% of test executions that passed> 95%< 90%
Flaky rate% of tests with inconsistent results< 5%> 10%
Open bug countNumber of unresolved issuesTrending downTrending up
Needs Review countItems awaiting human decisionNear zeroGrowing backlog
Mean time to detectHow quickly failures are caught< 1 hour> 4 hours

Pass rate alone does not tell the full story

A 95% pass rate with 3 critical-severity failures is riskier than 90% with only cosmetic issues. Always check severity alongside pass rate.

Understanding flakiness

A test is flaky when it produces different results across runs without any product changes. Certyn measures flakiness using a flip-rate algorithm:

  • A flip is when a test switches between pass and fail on consecutive runs
  • Flakiness score = number of flips / (total runs - 1), expressed as a percentage
  • Tests with a flakiness score above 20% (and at least 5 runs) are candidates for quarantine

Flaky tests erode confidence in your results. A flaky failure looks the same as a real failure, making it harder to spot actual bugs.

Risk signals to watch

These patterns indicate growing risk:

SignalWhat it meansRecommended action
Declining pass rate (week over week)New failures are appearingInvestigate recent failures, check for regressions
Growing open-bug countIssues are being created faster than resolvedPrioritize triage, consider blocking releases
Rising flaky rateTest suite reliability is degradingQuarantine worst offenders, investigate root causes
Stale Needs Review itemsHuman decisions are backloggedClear the review queue to prevent drift
Blocked runsTests cannot executeCheck environment health, data dependencies
PatternInterpretation
Stable (< 5% variance week over week)Suite is healthy, changes are not introducing regressions
Declining (> 5% drop week over week)Regressions or environment instability — investigate
Volatile (frequent swings)Likely flakiness — check flaky rate and quarantine unstable tests
Improving (> 5% rise week over week)Fixes are landing and being verified

Environment health

Each environment gets a daily health summary based on that day's executions. A healthy environment has:

  • recent executions completing successfully
  • pass rate above your project threshold
  • no new critical bugs
  • no stale blocked runs