Getting the Most Out of ScyllaDB Monitoring: ShareChat's Tips
ScyllaDB
556 views
32 slides
Jun 19, 2024
Slide 1 of 32
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
About This Presentation
ScyllaDB monitoring provides a lot of useful information. But sometimes it’s not easy to find the root of the problem if something is wrong or even estimate the remaining capacity by the load on the cluster. This talk shares our team's practical tips on: 1) How to find the root of the problem ...
ScyllaDB monitoring provides a lot of useful information. But sometimes it’s not easy to find the root of the problem if something is wrong or even estimate the remaining capacity by the load on the cluster. This talk shares our team's practical tips on: 1) How to find the root of the problem by metrics if ScyllaDB is slow 2) How to interpret the load and plan capacity for the future 3) Compaction strategies and how to choose the right one 4) Important metrics which aren’t available in the default monitoring setup.
Size: 13.94 MB
Language: en
Added: Jun 19, 2024
Slides: 32 pages
Slide Content
Getting The Most Out of ScyllaDB Monitoring: Rarely Mentioned Debugging and Tuning Tips Andrei Manakov, Staff Software Engineer at ShareChat
Andrei Manakov More than 13 years experience in industry Designed and developed multiple highload projects Passionate about performance problems in distributed systems Developing TikTok-like app with more than 20m DAU
Why Load is Not Really U seful ? fd https://github.com/scylladb/scylla-monitoring/issues/2003
CPU Capacity Analysis fd max(sum(rate(scylla_scheduler_runtime_ms{group!="compaction"}[2m])) by (instance, shard))/10 fd avg (sum(rate(scylla_scheduler_runtime_ms{group!="compaction"}[2m])) by (instance, shard))/10