Engineering tools for making smarter decisions .pptx

Engineering tools for making smarter decisions Tamir Dresher Director of Architecture @ Payoneer @tamir_dresher

Agenda 1. Home/Office/Freelancer 2. Choosing communication Under uncertainty 3. Resource allocation 4. Choosing the best library

Head of Architecture @ Who am I? Software Engineering Lecturer Ruppin Academic Center & My Books:

Example 1 Work from home or office or take Freelancer

Example 1: Work from home or office or take Freelancer We were given a project which we estimate will take 360 Man-Days to complete 10 people team - FTEs (Full time employees) and Freelancers each with a different cost FTEs Can work at home or at the office Office has a cost and if more than 2 days, cost increase Freelancers Has lesser productivity than FTE but lower cost Cost increase if more than 8 freelancers What is the ideal combination of FTEs and Freelancers, and how many office days?

Brute force =C6*(C10+C5*C8/5)+C4*IF(C4>8,C11,C9)+IF(C5>3,20,0)*C5

Tip 1 – Name you cells Ctrl+F3 will open the Name Manager

Brute force

Brute force Let’s run over all the possible values of our variables We can run a script BUT Excel’s build-in What-If feature is much faster

Brute force Let’s run over all the possible values of our variables We can run a script BUT Excel’s build-in What-If feature is much faster Step 1 Write all values for variable 1 as rows, and all values for variable 2 as columns

Brute force Let’s run over all the possible values of our variables We can run a script BUT Excel’s build-in What-If feature is much faster Step 2 Reference you target function in the upper left cell of the table

Brute force Let’s run over all the possible values of our variables We can run a script BUT Excel’s build-in What-If feature is much faster Step 3 Mark the table Open What-If -> Data Table under the Data tab in the ribbon Then reference the cell of the variable the top row represents and the variable the left column represents

Brute force Let’s run over all the possible values of our variables We can run a script BUT Excel’s build-in What-If feature is much faster

Tip 2 – Colorize your cells Use Conditional Formatting to quickly find Minimal and Maximal values

Brute force Let’s run over all the possible values of our variables We can run a script BUT Excel’s build-in What-If feature is much faster

Example 2 Choosing the best communication method OR how to choose your next restaurant

Example 2: Choosing the best communication method We are adding a new recommendation capability to our product and the team has 2 candidate possible algorithms. We are not sure which one will perform better (which one will lead to most clicks) Naïve A/B Testing will expose each algorithm to 50% of the users for a period of time and then compare the results. But If one algorithm is clearly better, we might be losing significant revenue while continuing to show the underperforming one during the test period

Multi Armed Bandit

Multi Armed Bandit balancing exploration (trying different options) and exploitation (sticking to the best-known option).

My restaurant search Corporate headquarters A B C D

My restaurant search Corporate headquarters A B C D which one should I pick from now on?

My restaurant search Corporate headquarters A B C D Even a broken clock shows the right time twice a day

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling)

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling) 1

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling) 1 0.2 0.4 0.6 0.8 1 2 3 4 5

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling) 1 0.2 0.4 0.6 0.8 1 2 3 4 5 random.NextDouble (); -> 0.5

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling) Random sample: 3 Random sample: 2 Random sample: 4 Random sample: 1

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling)

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling) Random sample: 4 Random sample: 2 Random sample: 3 Random sample: 1

My restaurant search Corporate headquarters A B C D Multi Armed Bandit approach (Thompson sampling)

Multi Armed Bandit - summary Balancing exploration (trying new options) and exploitation (choosing the best-known option) across multiple choices (e.g., restaurants, features). The goal is to maximize reward over time by dynamically updating the selection process based on feedback. Thompson sampling – generate a random value per each item/option by using it distribution array and select the item with the highest value Beta distribution is a preferred way to do so

Example 3 SaaS expansion in the cloud and how to satisfy FinOps

Example 3: SaaS expansion in the cloud Our SaaS product becomes massively popular and has been expanded into multiple regions in the cloud. Each region has different load and base latency. We want to optimize the cost while maintaining high performance and availability to all users Decisions to make: How many instance per each resource type in each region? How to distribute the load between the regions? Constraints: Each region must provide at least 100,000 compute units (CU) Latency must stay below a certain threshold Some regions are critical and should have lower latency and availability Our budget has a limit of 15K USD

Formalize the problem (1) Region Critical Base Load Base latency (ms) Routed load to (0..1) Instances US-East US-West EU-Central Asia-Pacific #Small #Medium #Large US-East 1 4500 50 ? ? ? ? ? ? ? US-West 3200 60 ? ? ? ? ? ? ? EU-Central 1 3250 58 ? ? ? ? ? ? ? Asia-Pacific 5500 80 ? ? ? ? ? ? ? Increase Latency to virtually demand more resources Minimize Cost = (#Small us-east +…+#Small Asia )*Cost small +…+ (#Large us-east +…+#Large Asia )*Cost large

Formalize the problem (1) Region Critical Base Load Base latency (ms) Routed load to (0..1) Instances US-East US-West EU-Central Asia-Pacific #Small #Medium #Large US-East 1 4500 50 ? ? ? ? ? ? ? US-West 3200 60 ? ? ? ? ? ? ? EU-Central 1 3250 58 ? ? ? ? ? ? ? Asia-Pacific 5500 80 ? ? ? ? ? ? ? Constraints: Regions routed load sum to 1 Instance count are integers Cost<15,000$ Latency < MAX_LATENCY Each region has minimal capacity Minimize Cost = (#Small us-east +…+#Small Asia )*Cost small +…+ (#Large us-east +…+#Large Asia )*Cost large

Linear Programming/Optimization Ideal for resource allocation problems

Linear Programming/Optimization You can use Excel solver or use Google OR-Tools for .Net in your code

Linear Programming/Optimization You can use Excel solver or use Google OR-Tools for .Net in your code https://github.com/tamirdresher/EngineeringToolsForSmarterDecisions/tree/master/LinearProgramming

Linear Programming - Real world scenario How Uber Optimizes the Timing of Push Notifications using ML and Linear Programming Where x i,t is a binary indicator of whether to send push i at time t , and s i,t is the score (value) of sending push i at time t .

Example 4 Deciding which Framework/Library to use

Example 4: Deciding which Framework/Library to use You're tasked with choosing a third-party library for the upcoming project. you need to consider several factors to ensure you pick the right one from 10 potential libraries. Each library is evaluated based on these attributes: Ease of Use Performance Cost Support and Community Documentation Quality Downloads/Stars

Example 4: Deciding which Framework/Library to use Library Ease of use (1-10) Performance (TPS) Cost($) Support (1,0) Documentation Quality (1-10) Downloads Library1 5 700 500 7 1000 Library2 7 650 1200 1 6 2000 Library3 8 699 450 9 500 Library4 5 520 600 1 5 900 Library5 6 200 150 1 8 250 Library6 9 600 800 4 740 Library7 4 320 350 1 2 800 Library8 5 875 300 1 8 600 Library9 2 900 450 1 2 700 Library10 10 1000 10 1 10 1 Issues: Scale is not comparable For cost, larger value should give lower score How should we take #Downloads into account

Normalizing the data Each attribute has different scales of importance attributes like cost (which have a wider range) have a disproportionate impact on the decision min-max normalization – transform to a range of [0,1] Z-score – transforms to a range of [-1,1] min-max makes numbers fit between 0 and 1, while z-score shows how far each number is from the average, using the idea of "how spread out" the numbers are.

Normalized attributes Library Ease of use (1-10) Performance (TPS) Cost($) Support (1,0) Documentation Quality (1-10) Downloads Library1 -0.478 0.224 0.060 -1.528 0.338 0.497 Library2 0.391 0.015 2.255 0.655 -0.038 2.478 Library3 0.826 0.220 -0.097 -1.528 1.089 -0.493 Library4 -0.478 -0.529 0.373 0.655 -0.413 0.299 Library5 -0.043 -1.868 -1.038 0.655 0.714 -0.989 Library6 1.261 -0.194 1.001 -1.528 -0.789 -0.018 Library7 -0.913 -1.366 -0.411 0.655 -1.540 0.101 Library8 -0.478 0.956 -0.568 0.655 0.714 -0.295 Library9 -1.783 1.061 -0.097 0.655 -1.540 -0.097 Library10 1.696 1.480 -1.477 0.655 1.465 -1.482 =( B2 -AVERAGE( B:B ))/STDEV.P( B:B ) Cost should order should be reversed. So we will multiply by -1

Normalized attributes with cost order fix Library Ease of use (1-10) Performance (TPS) Cost($) Support (1,0) Documentation Quality (1-10) Downloads Library1 -0.478 0.224 -0.060 -1.528 0.338 0.497 Library2 0.391 0.015 -2.255 0.655 -0.038 2.478 Library3 0.826 0.220 0.097 -1.528 1.089 -0.493 Library4 -0.478 -0.529 -0.373 0.655 -0.413 0.299 Library5 -0.043 -1.868 1.038 0.655 0.714 -0.989 Library6 1.261 -0.194 -1.001 -1.528 -0.789 -0.018 Library7 -0.913 -1.366 0.411 0.655 -1.540 0.101 Library8 -0.478 0.956 0.568 0.655 0.714 -0.295 Library9 -1.783 1.061 0.097 0.655 -1.540 -0.097 Library10 1.696 1.480 1.477 0.655 1.465 -1.482

Normalized attributes with cost order fix – with avg score Library Ease of use (1-10) Performance (TPS) Cost($) Support (1,0) Documentation Quality (1-10) Downloads Library1 -0.478 0.224 -0.060 -1.528 0.338 0.497 Library2 0.391 0.015 -2.255 0.655 -0.038 2.478 Library3 0.826 0.220 0.097 -1.528 1.089 -0.493 Library4 -0.478 -0.529 -0.373 0.655 -0.413 0.299 Library5 -0.043 -1.868 1.038 0.655 0.714 -0.989 Library6 1.261 -0.194 -1.001 -1.528 -0.789 -0.018 Library7 -0.913 -1.366 0.411 0.655 -1.540 0.101 Library8 -0.478 0.956 0.568 0.655 0.714 -0.295 Library9 -1.783 1.061 0.097 0.655 -1.540 -0.097 Library10 1.696 1.480 1.477 0.655 1.465 -1.482 Score -0.16769 0.20769 0.035262 -0.13999 -0.08225 -0.37801 -0.44203 0.353136 -0.26778 0.881655

IMDB Score

Example 4: Deciding which Framework/Library to use Library Ease of use (1-10) Performance (TPS) Cost($) Support (1,0) Documentation Quality (1-10) Downloads Score (Normalized + IMDB) Library1 5 700 500 7 1000 -0.27329 Library2 7 650 1200 1 6 2000 -0.23461 Library3 8 699 450 9 500 0.1175 Library4 5 520 600 1 5 900 -0.20499 Library5 6 200 150 1 8 250 0.070731 Library6 9 600 800 4 740 -0.39643 Library7 4 320 350 1 2 800 -0.48942 Library8 5 875 300 1 8 600 0.413856 Library9 2 900 450 1 2 700 -0.26415 Library10 10 1000 10 1 10 1 0.01341 Final results

Summary 1. Home/Office/Freelancer – Brute force & What-If 2. Choosing communication Under uncertainty – Multi Armed Bandit 3. Resource allocation – Linear Optimization 4. Choosing the best library – IMDB Score https://github.com/tamirdresher/EngineeringToolsForSmarterDecisions

Engineering tools for making smarter decisions .pptx

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Engineering tools for making smarter decisions .pptx

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Slide 21

Slide 22

Slide 23

Slide 24

Slide 25

Slide 26

Slide 27

Slide 28

Slide 29

Slide 30

Slide 31

Slide 32

Slide 33

Slide 34

Slide 35

Slide 36

Slide 37

Slide 38

Slide 39

Slide 40

Slide 41

Slide 42

Slide 43

Slide 44

Slide 45

Slide 46

Slide 47

Slide 48

Slide 49

Slide 50

Slide 51

Slide 52

Slide 53

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Pray For The Peace Of Jerusalem and You Will Prosper

Don_t_Waste_Your_Life_God.....powerpoint

VILLASUR_FACTORS_TO_CONSIDER_IN_PLATING_SALAD_10-13.pdf

Fertility awareness methods for women in the society

Chapter 5 Arithmetic Functions Computer Organisation and Architecture

syakira bhasa inggris (1) (1).pptx.......