Spotify: Automating Cassandra repairs

planetcassandra 3,718 views 51 slides Oct 02, 2015

Slide 1 of 51

About This Presentation

Anti-entropy repairs are known to be a very peculiar maintenance operation of Cassandra clusters. They are problematic mostly because of the potential of having negative impact on the cluster's performance. Another problematic aspect is the difficulty of managing the repairs of Cassandra cluster...

Size: 438.26 KB

Language: en

Added: Oct 02, 2015

Slides: 51 pages

Slide Content

Automating Cassandra
Repairs

Radovan Zvoncek
[email protected]

github.com/spotify/cassandra-reaper
#CassandraSummit

About zvo

About zvo
Likes pancakes

About zvo
Likes pancakes

Does this for the 3rd time

About zvo
Likes pancakes

Does this for the 3rd time

Works at Spotify

Working at Spotify
Is autonomous

Squads responsible for their full stack

Including Cassandra

Cassandra

Node’s data

Cassandra

Replication
Cassandra

Running Cassandra
Requires many things

One of them is keeping data consistent

Otherwise it can get lost or reappear

Running Cassandra
Requires many things

One of them is keeping data consistent

Eventually

Eventual consistency

Cassandra

Eventual consistency

Read Repairs

Cassandra
R
W

Eventual consistency

Hinted Handoff

Cassandra

Eventual consistency

Anti-entropy Repair

Cassandra

Coordinated process

Anti-entropy Repair

Coordinated process
Four steps:

Anti-entropy Repair

Coordinated process
Four steps:
1: Hash

Anti-entropy Repair
#
#
#

Coordinated process
Four steps:
1: Hash
2: Compare

Anti-entropy Repair
###

Coordinated process
Four steps:
1: Hash
2: Compare
3: Stream

Anti-entropy Repair

Coordinated process
Four steps:
1: Hash
2: Compare
3: Stream
4: Merge

Anti-entropy Repair

Coordinated process
Four steps:
1: Hash
2: Compare
3: Stream
4: Merge
Can go wild...

Anti-entropy Repair

Repair gone wild

Repair gone wild
Eats a lot of disk IO
●because of hashing all the data

Repair gone wild
Eats a lot of disk IO
Saturates the network
●because of streaming a lot of data around

Repair gone wild
Eats a lot of disk IO
Saturates the network
Fills up the disk
●because of receiving all replicas, possibly
from all other data centers

Repair gone wild
Eats a lot of disk IO
Saturates the network
Fills up the disk
Causes a ton of compactions
●because of having to merge the received
data

Repair gone wild
Eats a lot of disk IO
Saturates the network
Fills up the disk
Causes a ton of compactions

… one better be careful

Careful repair

nodetool repair

Careful repair
All three intervals

Partitioner range
●nodetool repair -pr

Careful repair
This interval only

Start & end tokens
●nodetool repair -pr -st -et

Careful repair
A part of interval only

Requires splitting the ring into smaller intervals

Smaller intervals mean less data

Less data means fewer repairs gone wild

Careful repair

Smaller intervals also mean more intervals

More intervals mean more actual repairs

Repairs need to be babysat :(

Careful repair

The Spotify way
Feature teams meant to do features
Not waste time operating their C* clusters

Cron-ing nodetool repair is no good
●mostly due to no feedback loop

This all led to creation of the Reaper

The Reaper
REST(ish) service

Does a lot of JMX

Orchestrates repairs for you

The reaping
You:
curl http://reaper/cluster --data ‘{“seedHost” : “my.cassandra.host.net”}’

The Reaper:
●Figures out cluster info (e.g. name, partitioner)

The reaping
You:
curl http://reaper/repair_run --data ‘{“clusterName”: “myCluster”}’

The Reaper:
●Prepares repair intervals

The reaping
You:
curl -X PUT http://reaper/repair_run/42 -d state=RUNNING

The Reaper:
●Starts triggering repairs of repair intervals

Reaper’s features

Reaper’s features
Carefulness - doesn’t kill a node
●checks for node load
●backs off after repairing an interval

Reaper’s features
Carefulness - doesn’t kill a node
Resilience - retries when things break
●because things break all the time

Reaper’s features
Carefulness - doesn’t kill a node
Resilience - retries when things break
Parallelism - no idle nodes
●multiple small intervals in parallel

Reaper’s features
Carefulness - doesn’t kill a node
Resilience - retries when things break
Parallelism - no idle nodes
Scheduling - setup things only once
●regular full-ring repairs

Reaper’s features
Carefulness - doesn’t kill a node
Resilience - retries when things break
Parallelism - no idle nodes
Scheduling - setup things only once
Persistency - state saved somewhere
●a bit of extra resilience

What we reaped
First repair done 2015-01-28
1,700 repairs since then, recently 90 per week
176,000 (16%) segments failed at least once
60 repair failures

What we reaped

Reaper’s Future
CASSANDRA-10070

Whatever is needed until then

Greatest benefit
Cassandra Reaper automates a very tedious
maintenance operation of Cassandra clusters
in a rather smart, efficient and careful manner
while requiring minimal Cassandra expertise

github.com/spotify/cassandra-reaper

#CassandraSummit

Spotify: Automating Cassandra repairs

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Spotify: Automating Cassandra repairs

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Slide 21

Slide 22

Slide 23

Slide 24

Slide 25

Slide 26

Slide 27

Slide 28

Slide 29

Slide 30

Slide 31

Slide 32

Slide 33

Slide 34

Slide 35

Slide 36

Slide 37

Slide 38

Slide 39

Slide 40

Slide 41

Slide 42

Slide 43

Slide 44

Slide 45

Slide 46

Slide 47

Slide 48

Slide 49

Slide 50

Slide 51

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

8-top-ai-courses-for-customer-support-representatives-in-2025.pptx

7-essential-ai-courses-for-call-center-supervisors-in-2025.pptx

25-essential-ai-courses-for-user-support-specialists-in-2025.pptx

8-essential-ai-courses-for-insurance-customer-service-representatives-in-2025.pptx

Know for Certain

PPT OPD LES 3ertt4t4tqqqe23e3e3rq2qq232.pptx