Cost-Efficient Stream Processing with RisingWave and ScyllaDB

ScyllaDB 375 views 19 slides Jun 19, 2024
Slide 1
Slide 1 of 19
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19

About This Presentation

A technical look at how to achieve cost-efficient stream processing with ScyllaDB and RisingWave, an open-source distributed streaming database.


Slide Content

Cost-efficient Stream Processing with RisingWave and ScyllaDB Yingjun Wu, CEO at RisingWave

Yingjun Wu Chief Everything Officer Ex-AWS Redshift Ex-IBM Research Almaden PhD in databases and stream processing Your photo goes here, smile :)

State Management: Traditional Solutions MapReduce style Coupled compute and storage architecture

State Management: Traditional Solutions MapReduce style Coupled compute and storage architecture

State Management: Cloud-based Solution Cloud-native style Decoupled compute and storage architecture

State Management: Cloud-based Solution Cloud-native style Decoupled compute and storage architecture

State Management in the cloud Consider joining two data streams Impression stream Click stream

State Management in the cloud Consider joining two data streams Impression stream Click stream

State Management in the cloud Consider joining two data streams Impression stream Click stream

State Management in the cloud Consider joining two data streams Impression stream Click stream

Tiered Storage Log structured merge tree Recent data is cached in local machine Older data is perically compacted to lower levels

Comparison

State Management: Failure Recovery

State Management: Elastic Scaling

Demo Time! Extract, Transform, and Load (ETL) data from PostgreSQL to ScyllaDB 2 lines of code! Streaming ETL

Demo Time! Perform complex computation and then deliver data to ScyllaDB Streaming analytics Materialized view 1 Materialized view 2 Materialized view 3

Demo Time! Streaming analytics create materialized view mv1 as …; c reate materialized view mv2 as …; c reate materialized view mv3 as select … from mv1, mv2 …; create sink my_scylla as select … from mv3 with (connector = ‘cassandra’, …); Computation : Sink:

Modern companies need to gain insights from real-time data RisingWave + ScyllaDB can work collaboratively to reduce stream processing cost while attaining top performance Conclusion

Stay in Touch Yingjun Wu [email protected] yingjunwu yingjunwu https://www.linkedin.com/in/yingjun-wu/ risingwave.com/slack
Tags