Measuring Query Latency the Hard Way: An Adventure in Impractical Postgres Monitoring by Simon Notley

ScyllaDB 0 views 20 slides Oct 15, 2025

Slide 1 of 20

About This Presentation

Sampling the session state (as exposed by pg_stat_activity) is a surprisingly powerful way to understand how your Postgres instance spends its time. It is something I can wholeheartedly recommend to any Postgres DBA that needs a lightweight way to monitor query performance in production. However, it...

Size: 2.18 MB

Language: en

Added: Oct 15, 2025

Slides: 20 pages

Slide Content

A ScyllaDB Community
Measuring Query Latency the
Hard Way: An Adventure in
Impractical Postgres Monitoring
Simon Notley
Observability and Optimization

Simon Notley (he/him)

Observability and Optimization PM at EDB
■Something cool: Gained the freedom of Tryfan
■Perspective on P99s: The time 1 in every 100 of
your users is waiting and wondering if it’s broken
■Another thing: I used to race bicycles
■Away from work: dad stuff

Pursuing terrible ideas can be fun

Take a good idea
Understand it’s strengths and
weaknesses
Undeterred, try to apply it to something
you know it’s no good at
Have fun
Session sampling
Good at proportions, bad at details
Query latency
“fun”

Time-domain sampling, huh! What is it good for?

23%
20%

Time-domain sampling, huh! What is it good for?

Details
Proportions
…and I will now use
it for details…

Even an end has a start

time
We know the query started here
It was still running here
It wasn’t running here
a b

Even an end has a start
time
a bc
estimated_duration = 2a + c

Unbiased!

For a 1000 ms query, for a range of sample periods, calculate our estimated duration for all possible
relative positions of the query and the samples
Box: 25th - 75th percentile

Whiskers: 5th - 95th percentile

Central mark: median (and mean)

Biased?

Query IDTrue Mean Latency / ms
1 56 ± 3
2 300 ± 10
3 550 ± 20
4 800 ± 20
5 1050 ± 20

Biased?

Query IDTrue Mean Latency / msEstimated Mean Latency / ms
1 56 ± 3 520 ± 70
2 300 ± 10 520 ± 40
3 550 ± 20 680 ± 40
4 800 ± 20 870 ± 40
5 1050 ± 20 1080 ± 40

Biased?

Query IDTrue Mean Latency / msEstimated Mean Latency / msTrue Mean Latency (sampled
queries only) / ms
1 56 ± 3 520 ± 70 520 ± 60
2 300 ± 10 520 ± 40 520 ± 30
3 550 ± 20 680 ± 40 680 ± 30
4 800 ± 20 870 ± 40 870 ± 20
5 1050 ± 20 1080 ± 40 1080 ± 20

Stop! Weight a minute!

Query IDTrue Mean Latency / msEstimated Mean Latency / ms
1 56 ± 3 19 ± 4
2 300 ± 10 80 ± 20
3 550 ± 20 150 ± 50
4 800 ± 20 270 ± 100
5 1050 ± 20 500 ± 200

Funky charts!

true
true_obs
elapsed
estimate
1 2 3 4 5

Not so funky chart

Funkier…

Now we’re talking!

Nobody expects vector search!

Query IDTrue Mean Latency /
ms
Vector Search Mean
Latency / ms
Percentage error at
run-level
1 56 ± 3 60 ± 20 10 ± 30
2 300 ± 10 300 ± 40 0 ± 10
3 550 ± 20 550 ± 50 0 ± 9
4 800 ± 20 800 ± 60 0 ± 7
5 1050 ± 20 1060 ± 60 1 ± 5

Measuring Query Latency the Hard Way: An Adventure in Impractical Postgres Monitoring by Simon Notley

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Measuring Query Latency the Hard Way: An Adventure in Impractical Postgres Monitoring by Simon Notley

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

8-top-ai-courses-for-customer-support-representatives-in-2025.pptx

7-essential-ai-courses-for-call-center-supervisors-in-2025.pptx

25-essential-ai-courses-for-user-support-specialists-in-2025.pptx

8-essential-ai-courses-for-insurance-customer-service-representatives-in-2025.pptx

Know for Certain

PPT OPD LES 3ertt4t4tqqqe23e3e3rq2qq232.pptx