Question 1

What sampling rate should I use for distributed tracing?

Accepted Answer

Depends on traffic: under 100 RPS keep 100% (no sampling). 100-1,000 RPS use 10-25% head-based, or 100% with tail-based to preserve error traces. 1,000-10,000 RPS use 1-5% head-based. Above 10,000 RPS go to 0.1-1% head-based and rely on tail-based sampling to keep interesting traces.

Question 2

Head-based vs tail-based sampling?

Accepted Answer

Head-based decides at trace start whether to record — simple but may drop slow or errored traces you would want to see. Tail-based buffers all spans and decides at trace end — lets you keep 100% of errors and slow requests while dropping 99% of fast successful ones. Tail requires more Collector memory but produces better data.

Question 3

How expensive is distributed tracing?

Accepted Answer

Datadog APM: ~$1.27 per million spans. Honeycomb: ~$0.50/M. New Relic: ~$0.30/M. Self-hosted Tempo on S3: under $0.10/M. At 1,000 RPS with 10 spans/request and 10% sampling, you produce ~260M spans/month — that's $330 on Datadog or $26 on Tempo.

Question 4

Should I sample at the application or the Collector?

Accepted Answer

At the OpenTelemetry Collector — that's where tail-based sampling lives. Application-level (SDK) sampling is head-based only. Configure the Collector with the tail_sampling processor to keep all error traces, all traces over a latency threshold, and a fixed percent of normal traces.

Question 5

Can I run my own tracing backend cheaply?

Accepted Answer

Yes. Grafana Tempo on S3 is the cheapest self-hosted option — index is tiny, content compressed in S3. Jaeger with Cassandra/ES is more operational overhead. For most teams under 100M spans/month, managed (Honeycomb, Grafana Cloud Traces) is cheaper once you count engineering time.

Traffic	Sampling
< 100 RPS	100% — don't sample
100-1,000 RPS	10-25% head-based, OR 100% + tail-based
1,000-10,000 RPS	1-5% head-based, OR aggressive tail
> 10,000 RPS	0.1-1% head-based + tail for errors

Trace Sampling Calculator: Spans, Cost by Vendor (2026)

Trace Sampling: The Cost Lever That Matters

Sampling Strategies

Sampling Rate by Traffic

Related Tools

Cardinality Estimator

Log Volume Cost Calculator

Latency Percentile Calculator

Trace bills outrunning your value?