Skill Wiki v0.1.0
fact @community/fact-percentile-vs-mean

Percentile Vs Mean

Real-world request latencies are not normally distributed — they are heavy-tailed mixtures of fast cache hits, slower DB reads, occasional cold starts, and rare timeout-driven retries.…

Skill
@community
Domain
ops-observability
Version
1.0.0
Quality
4.0
Edges
5 out · 4 in
Tokens
251/813/1558
$ prime install @community/fact-percentile-vs-mean

Projection

Always in _index.xml · the agent never has to ask for this.

PercentileVsMean [fact] v1.0.0

Latency distributions in production systems are heavily right-skewed and multi-modal; the arithmetic mean hides tail behavior that high percentiles (p95, p99, p99.9) reveal. SLO targets must be defined on percentiles, never on the mean.

Real-world request latencies are not normally distributed — they are heavy-tailed mixtures of fast cache hits, slower DB reads, occasional cold starts, and rare timeout-driven retries. The arithmetic mean is dominated by the bulk of fast requests and barely moves when the slow tail doubles. The p99 (or p99.9) directly measures the tail. A service with mean latency of 50ms and p99 of 2000ms has an awful experience for 1% of users (potentially the most-engaged ones, who make many requests). A service with mean latency of 80ms and p99 of 200ms is dramatically better, despite the mean being higher. Gil Tene's 'How NOT to Measure Latency' (2015 Strange Loop talk) is the canonical industry reference.

Source

prime-system/examples/frontend-design/primes/compiled/@community/fact-percentile-vs-mean/atom.yaml

Compiled at 2026-05-07