Big data Scratch The Surface

PredragSimic7 400 views 19 slides Feb 22, 2019
Slide 1
Slide 1 of 19
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19

About This Presentation

Mladen Antunovic - Senior Manager Big Data @ Mapp Digital GmbH


Slide Content

Big data
Scratch the surface

2
About me (https://www.linkedin.com/in/antunovic/)
Mladen Antunović
Sr. Manager, Big Data Engineering –Mapp Digital GmbH München
Work
dipl.el.ing., Faculty of Electrical Engineering. University of Sarajevo
Education
AtlantBHSarajevo (2008 -2013) –Lead Engineer
Mapp Digital (2013 -Current) –Sr. Manager, Big Data Engineering
Experience

3
What am I talking about?

4
What is Big data?
“Today, every two days we create as much
data as we did from the beginning of time until
2000.”
That’s right, every two days

5
Usage
Predictions
Simulations
Analytics
Artificial Intelligence
Machine Learning
Statistics

6

7
Basic App
Architecture
7

8
Distributed App
Architecture
8
831
065
23

9
Distributed platform architecture
Masters
Workers
Lookup (ZK)

10
Big Data = Big Problems
10
Costs
Performance
Scalability
High Availability (HA)
Maintenance (Operation)
Migration (Deployments, Upgrades, UPTIME)

@Mapp

12
Big Data Infrastructure
12
Self managed (2 Datacenters in Munich)
Open Source
No paid support
~600 Servers (Dell R4xx, R6xx, R8xx)
1 -10 Gbps Network
Private cloud + Docker
3 people (Automated)

13

14
Planning
14
Software (OS, Big Data software)
# of servers / # of racks
CPUs
Memory
Storage (SATA, SSD)
Network
Application architecture

15
Maintenance
15
Uptime SLAs
Hardware replacements
Software upgrades (before | now)
Migration
Network upgrade
Application architecture

16
Improvements
16
Monitoring
Log analysis
Metrics collection
Logging
Alerting
Machine Learning
Artificial Inteligence

17
Public vs Private Cloud
17
Costs
Scalability
Maintenance (failures, backups, replication, recovery)
Migration
Project (Phase, External/Internal)

18
Life @Mapp

1919
Q&A
Should we use Big Data technologies for every project?
Why?
Why not?