APEC: How to ensure your application is operated properly

alterrebe 1 views 27 slides Sep 27, 2025
Slide 1
Slide 1 of 27
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27

About This Presentation

A presentation at the DevOps Finland meetup in Helsinki in October 2023


Slide Content

APEC: How to ensure your application is operated properly? DevOps Finland meetup 10.10.2023 Uri Savelchev

Agenda The problem and a possible solution Goals we want to achieve What’s inside the checklist? APEC History in Zalando Learnings Q & A DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly?

Zalando Tech 2500+ software engineers 300+ engineering teams 200+ K8s clusters 3500 active applications H ow to maintain operation excellence? DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly?

Solution: a checklist? Application Production Engineering Checklist (APEC) is a set of checks that teams should have in place for all applications. The items on APEC have been identified from system reviews, post-mortem documents, existing production systems, and Zalando requirements. Together the items on the checklist form a shared understanding of what production readiness is for teams shipping to production. Outcome is a status ( GREEN , AMBER , RED ) and a list of what’s missing DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly?

Operational excellence metrics DevOps Finland meetup APEC - How to ensure your application is operated properly? per team per department per organization

A tool to find gaps in operations DevOps Finland meetup APEC - How to ensure your application is operated properly? The APEC status A summary of what’s missing The review process itself: answering questions helps you to look at your application from a different angle.

DevOps Finland meetup APEC - How to ensure your application is operated properly?

APEC is NOT about compliance DevOps Finland meetup APEC - How to ensure your application is operated properly? Security, Copyright, GDPR, … need separate tools and processes.

DevOps Finland meetup APEC - How to ensure your application is operated properly? What’s inside?

APEC Sections (non-complete) Application registry information Technical stack (and if it is included in the Tech Radar ) Tier evaluation Naming and Code Of Conduct Data Grade API specification Service Level Objectives Monitoring and tracing Playbooks Production Readiness Review DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly? https://opensource.zalando.com/tech-radar/

Tier evaluation DevOps Finland meetup APEC - How to ensure your application is operated properly? A way to classify the apps by their business criticality

DevOps Finland meetup APEC - How to ensure your application is operated properly?

Service Level Objectives API Specification is a contract, SLOs are an important appendix to the contract. DevOps Finland meetup APEC - How to ensure your application is operated properly?

Monitoring, tracing, playbooks Every application running in production must have: A dashboard with metrics A reasonable set of alerts Every application running in production should be instrumented with tracing (OpenTracing / OpenTelemetry) Every application being supported 24x7 must have reliable playbooks DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly?

APEC History DevOps Finland meetup APEC - How to ensure your application is operated properly?

APEC History A plain document Proof of Concept Small set of teams and application to try the approach A very long form Google-form style Painful to fill out A typical review took 20-30 minutes for every app More compact form with expandable sections Autodiscovery (API specs, tracing, monitoring, playbooks,...) Memoization — no need to repeat the same answer twice A repeated review takes 1-5 minutes usually DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly?

Backstage integration DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly?

Learnings Engineers: “filling forms is a bureaucracy” The process should be as fast/painless as possible Memoization (DRY principle) Autodetect and autofill everything you can There are no rules without exceptions Let people to use them Comments are important! (even though they aren’t processed automatically) The “color” of the APEC should be defined by most critical things DevOps Finland meetup APEC - How to ensure your application is operated properly?

Learn more A nice presentation of Heinrich Hartmann at DevOpsCon Berlin in 2022 Henning Jacobs’ blog post Zalando engineering blog DevOps Finland meetup APEC - How to ensure your application is operated properly?

DevOps Finland meetup APEC - How to ensure your application is operated properly?