Application Monitoring using Datadog

MuktaArankalleAphale 4,281 views 24 slides Dec 14, 2015
Slide 1
Slide 1 of 24
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24

About This Presentation

Application monitoring is being talked about a lot these days and it helps provide key information that is helpful in developing better software and also in taking some key business decision. Datadog offers monitoring as a service.


Slide Content

@muktaaWe are hiring! Send resume to: [email protected]
Application Monitoring
using Datadog
MuktaAphale
(DevOps Practice Head, WhiteHedge Technologies)
12 Dec 2015, DevOps Meetup, Pune

@muktaaWe are hiring! Send resume to: [email protected]
WhiteHedge Technologies

@muktaaWe are hiring! Send resume to: [email protected]
Application Monitoring
•Ensure that a software application processes and performs in an
expected manner and scope
APM Runtime+Metrics
Health+of+the+
application
Health+of+
infrastructure
Provide+
system/application+
feedback
Continuous+
Improvement

@muktaaWe are hiring! Send resume to: [email protected]
Datadog
•Monitoring as a Service
•Agent Based
•Python
•Integrations
•Dashboards
•Tagging
•Alerts
•Checks

@muktaaWe are hiring! Send resume to: [email protected]
Environment

@muktaaWe are hiring! Send resume to: [email protected]
Challenge
•Monitor System Health
•Monitor Redis, MongoDB
•Application (API Server) runs as dockercontainer
•Monitor application performance
•Logging slowed down performance
•Rapid development, No feedback mechanism
•Insight in application needed by management
•Insight needed by support, devopsand developers

@muktaaWe are hiring! Send resume to: [email protected]
Integration Dashboards

@muktaaWe are hiring! Send resume to: [email protected]
AWS

@muktaaWe are hiring! Send resume to: [email protected]
Amazon -ElastiCache

@muktaaWe are hiring! Send resume to: [email protected]
System Overview

@muktaaWe are hiring! Send resume to: [email protected]
But…
•How to impress the management?! !
•How to deal with X environments and Y
versions?
•How to measure performance, without
affecting performance?

@muktaaWe are hiring! Send resume to: [email protected]
Overview Dashboard
Dummy
Dummy
Dummy
Dummy
Dummy

@muktaaWe are hiring! Send resume to: [email protected]
Debug: API Error Rate
Which+endpoint+is+having+more+errors?+Which+error+codes+are+being+thrown?Search+logs+in+Logglyif+needed.

@muktaaWe are hiring! Send resume to: [email protected]
DatadogTags
•Inherited from
Integrations
•Custom tags

@muktaaWe are hiring! Send resume to: [email protected]
TemplatedDashboards
•Dashboard variables
•Dynamically explore metrics

@muktaaWe are hiring! Send resume to: [email protected]
DatadogAPI
•We can code:
•Instance configuration
•Infrastructure
•Deployments
•Why not monitoring?!
•Datadoghas great API

@muktaaWe are hiring! Send resume to: [email protected]
Datadogmodule for nodejs

@muktaaWe are hiring! Send resume to: [email protected]
Measuring http response times from
application code

@muktaaWe are hiring! Send resume to: [email protected]
Plot it!
Dummy1
Dummy2
Dummy3
Dummy4
Dummy+Data

@muktaaWe are hiring! Send resume to: [email protected]
What about performance while tracking
performance?
•Metrics from code are sent to local datadogagent using UDP
•Local datadogagent syncs the metrics to the datadogserver
•Datadogdashboard reflects the metrics with some delay
•Application performance does not get affected

@muktaaWe are hiring! Send resume to: [email protected]
Alerts

@muktaaWe are hiring! Send resume to: [email protected]
DatadogChecks
•Collect metrics from datadogagent check
•Out of the box agent checks
•Custom agent checks in Python
•Interesting use cases:
•Keep alive check (Service is up)
•Network Check (HTTP, TCP)
•Validate response for expected data

@muktaaWe are hiring! Send resume to: [email protected]
Using Datadogwe could…
•Monitor dynamic infrastructure
•Monitor system health
•Monitor application availability
•Monitor application performance
•Show application & infra health graphically
•Provide feedback about health of system

@muktaaWe are hiring! Send resume to: [email protected]
Thank You!
Questions?
We are Authorized DatadogPartners!
You can write to me at:
[email protected]
Twitter: @muktaa