Cost-Efficient Stream Processing with RisingWave and ScyllaDB
ScyllaDB
375 views
19 slides
Jun 19, 2024
Slide 1 of 19
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
About This Presentation
A technical look at how to achieve cost-efficient stream processing with ScyllaDB and RisingWave, an open-source distributed streaming database.
Size: 2.89 MB
Language: en
Added: Jun 19, 2024
Slides: 19 pages
Slide Content
Cost-efficient Stream Processing with RisingWave and ScyllaDB Yingjun Wu, CEO at RisingWave
Yingjun Wu Chief Everything Officer Ex-AWS Redshift Ex-IBM Research Almaden PhD in databases and stream processing Your photo goes here, smile :)
State Management: Traditional Solutions MapReduce style Coupled compute and storage architecture
State Management: Traditional Solutions MapReduce style Coupled compute and storage architecture
State Management: Cloud-based Solution Cloud-native style Decoupled compute and storage architecture
State Management: Cloud-based Solution Cloud-native style Decoupled compute and storage architecture
State Management in the cloud Consider joining two data streams Impression stream Click stream
State Management in the cloud Consider joining two data streams Impression stream Click stream
State Management in the cloud Consider joining two data streams Impression stream Click stream
State Management in the cloud Consider joining two data streams Impression stream Click stream
Tiered Storage Log structured merge tree Recent data is cached in local machine Older data is perically compacted to lower levels
Comparison
State Management: Failure Recovery
State Management: Elastic Scaling
Demo Time! Extract, Transform, and Load (ETL) data from PostgreSQL to ScyllaDB 2 lines of code! Streaming ETL
Demo Time! Perform complex computation and then deliver data to ScyllaDB Streaming analytics Materialized view 1 Materialized view 2 Materialized view 3
Demo Time! Streaming analytics create materialized view mv1 as …; c reate materialized view mv2 as …; c reate materialized view mv3 as select … from mv1, mv2 …; create sink my_scylla as select … from mv3 with (connector = ‘cassandra’, …); Computation : Sink:
Modern companies need to gain insights from real-time data RisingWave + ScyllaDB can work collaboratively to reduce stream processing cost while attaining top performance Conclusion