In today’s world it’s no longer enough to build systems that process big volumes of information. We now need applications that can handle large continuous streams of data with very low latency so we can react to the ever-changing environment around us. To efficiently handle such problems we need...
In today’s world it’s no longer enough to build systems that process big volumes of information. We now need applications that can handle large continuous streams of data with very low latency so we can react to the ever-changing environment around us. To efficiently handle such problems we need to deploy a stream processing solution. During the talk we’ll explore one of the most popular frameworks for stream processing – Apache Flink. We’ll see what unique capabilities it provides and how they apply to some real world problems. And we’ll also explore how it works under the hood and how to get the scalable and fault-tolerant stream processing that Flink provides.
Size: 2.35 MB
Language: en
Added: Dec 02, 2018
Slides: 73 pages
Slide Content
Large scale stream
processing with
Apache Flink
Nikolay Stoitsev
Sr. Software Engineer at Uber Tech Sofia
Stream Processing?
Stream Processing?
User Interaction Logs
Stream Processing?
User Interaction Logs
Application Logs
Stream Processing?
User Interaction Logs
Application Logs
Sensor Data
Stream Processing?
User Interaction Logs
Application Logs
Sensor Data
Database Commit Logs
Infinite Dataset
Producer
Stream
Producer
Stream HDFS
Producer
Stream HDFS
Hive
Producer
Stream HDFS
Hive
Big Latency
Producer
Stream
HDFS
Real-time
service
Apache Storm
storm.apache.org
High-latency & accurate
vs.
Low-latency & approximation