ML @ Instacart: Improving the quality of On-demand Grocery

SharathRao6 712 views 39 slides Feb 05, 2020
Slide 1
Slide 1 of 39
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31
Slide 32
32
Slide 33
33
Slide 34
34
Slide 35
35
Slide 36
36
Slide 37
37
Slide 38
38
Slide 39
39

About This Presentation

Mitigate impact of stockouts in Grocery delivery through machine learning. Near real-time inventory prediction.


Slide Content

Sharath Rao
http://www.pivenworld.com/
ML@Instacart:
Improving Quality of
On-Demand Grocery

#Orders with “Face Masks” on Instacart




@sharathrao

01
02
03

Groceries
delivered
to your
doorstep
or ready
for pickup
from
stores you
love
in as little
as an hour
+ + + =

and many
more ….

v

Customers
Shoppers
Products
(Advertisers)
Search
Advertising
Shopping
Logistics
Customer Service
Inventory
Picking
Loyalty
Stores
(Retailers)

Select Store Build Cart
Delivery/Pickup
Time Selection

01














Long tail of infrequent occurrences with
occasional severe impact if not addressed

Stockouts Poor Replacement

Source: https://hbr.org/2004/05/stock-outs-cause-walkouts




“Don’t show
me items that
can’t be
fulfilled”
“Call/chat during
picking regarding
replacements”
“Warn me about
stock level and
let me specify the
replacement”
“Trust the
shopper to
replace”
“Do not
replace item”
More permissive
Convenience Focus
More Proactive
Precision Focus

●Item Found Rate
○% of ordered items that are fulfilled
●Replacement Satisfaction Rate
○% of replacements implicitly/explicitly approved by customer

02

In-store activity assumed as ground truth of item being in the store
Model Probability(Found) as a supervised binary classification problem

Aggregate Found Rate Statistics at higher levels (Critical for Tail Items)
●Product
●Retailer + Product
●Retailer + Product + Region
XGBoost Model with ~100 features
Example Item Level Features (Effective for Popular Items)
●Time since last not-found/found
●Found Rate in last 6m, 3m, 1 week, 1 day, 6 hours

Availability scores low across US
Even in stores where it hasn’t been bought (due to
product level features)
A store in Mission District:
Availability score: bottom 2 percentile
Past found rate: extremely low
From the Vice News report: “The milk
substitute has become so popular in places
like Brooklyn that shortages have broken out,
and a grey market for the most sought-after
brand of the stuff — Oatly — has popped up
online. Sometimes liters of Oatly can sell for
as much as $18 each.”

What comes next?
You’ve been freed
Do you know
how hard it is to lead?
-
Source: https://smcl.org/blogs/post/wait-for-it/

03

●Retailer Location Selection
●Replacement Suggestions
during fulfillment/picking
●Picking Item List Ranking
●Search and Merchandising Ranking
●Low Stock Inform and Mitigate
●Post-Checkout Experience
●Chat with shopper during
fulfillment/picking
Customer Experience Shopper Experience

●Eroding trust in Catalog
●Customer Churn
Show all relevant
items including
low stock
Remove all low
stock items
●Eroding trust in Fulfillment
●Lower shopper productivity
●Customer Churn
How can we create the optimal search experience?

Good improvement in Found Rate
Unacceptable drop in Search Conversion Rate

Hidden Items
Query: “Milk chocolate”
We were often removing items customers really wanted!

Demoted Items
No drop in Search Conversion Rate
Low improvement in Found Rate

Customers were ordering demoted items anyway!
Query: “Milk chocolate”

●Use data collected from demotion experiment
●Learn a simple model/heuristic to predict whether a
demoted item would be ordered given search
Key Insight:
Highly relevant
Query: “Milk chocolate”

Top 2 features describing search context
●# of results
●# of demoted results from top 5
# of items demoted in top 5
results
Don’t operate here
Ensure that the search results are relevant, mostly in stock and retains good selection of products

“Don’t show
me items that
can’t be
fulfilled”
“Call/chat during
picking regarding
replacements”
“Warn me about
stock level and
let me specify the
replacement”
“Trust the
shopper to
replace”
“Do not
replace item”
More permissive
Convenience Focus
More Proactive
Precision Focus

Replacement suggestions are generated by an ML recommendation system

Do not replaceMore replacement
suggestions

●Retailer Location Selection
●Replacement Suggestions
during fulfillment
●Picking Item List Ranking
●Search and Merchandising Ranking
●Low Stock Inform and Mitigate
●Post-Checkout Experience
●Chat with shopper during
fulfillment/picking
40% relative reduction
in low quality deliveries
Customer Experience Shopper Experience

●Pick problems where ML can make unique contributions
○Hiring/training product minded ML Engineers is challenging
●Pick problems that are fundamental to the business with broad applications
○Instacart examples: Item Availability, Replacement recommendation, Product Targeting
●New use cases of existing models are often higher ROI than improving the model*
●Design and PM are key partners in getting the most value out of ML efforts