Practical Lessons from Predicting Clicks on Ads at Facebook

ssuser78eda8 1,066 views 31 slides Jan 26, 2015
Slide 1
Slide 1 of 31
Slide 1
1
Slide 2
2
Slide 3
3
Slide 4
4
Slide 5
5
Slide 6
6
Slide 7
7
Slide 8
8
Slide 9
9
Slide 10
10
Slide 11
11
Slide 12
12
Slide 13
13
Slide 14
14
Slide 15
15
Slide 16
16
Slide 17
17
Slide 18
18
Slide 19
19
Slide 20
20
Slide 21
21
Slide 22
22
Slide 23
23
Slide 24
24
Slide 25
25
Slide 26
26
Slide 27
27
Slide 28
28
Slide 29
29
Slide 30
30
Slide 31
31

About This Presentation

ADKDD'14
Facebook


Slide Content

Practical Lessons from Predicting Clicks on Ads at Facebook 2014/ 1/ 27 ( Tue. ) Chang Wei-Yuan @ MakeLab Lab Meeting Facebook ADKDD ‘14

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 2

Introduction 3

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 4

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 5

6

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 7

Decision tree feature transforms 8

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 9

Logistic regression for linear classifier Stochastic Gradient Descent (SGD) algorithm the tunable parameters are optimized by grid search Bayesian online learning scheme for profit regression 10

One advantages of LR over BOPR is that the model size is half the smaller model size may lead to better cache locality and thus faster cache lookup 11

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 12

Data freshness Click prediction systems are often deployed in dynamic environments where the data distribution changes over time. 13

14

15 These f indings indicate that it is worth retraining on a daily basis .

Batch one option would be to have a recurring daily job that retrains the models, possibly in batch Concurrency the training can be done via concurrency in a multi-core machine with large amount of memory 16

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 17

Online data joiner The boosted decision trees can be trained daily, but the linear classier can be trained in near real -time by online learning . O nline J oiner g enerates real -time training data used to train the linear classifier via online learning 18

Online data joiner The boosted decision trees can be trained daily, but the linear classier can be trained in near real -time by online learning . O nline J oiner g enerates real -time training data used to train the linear classifier via online learning H ow to label for a new instance ? 19

Online data joiner The boosted decision trees can be trained daily, but the linear classier can be trained in near real -time by online learning . O nline J oiner g enerates real -time training data used to train the linear classifier via online learning perform a distributed stream-to-stream join on ad impressions and ad clicks 20

21

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 22

Experiment Number of boosting trees 23

Experiment Boosting feature importance 24

Experiment Boosting feature importance 25

Experiment Historical features 26

Experiment Historical features 27

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 28

Conclusion This has inspired a promising hybrid model architecture for click prediction . b oosted decision trees and a linear classier online learning method with real-time training data 29

Outline Introduction Method Decision tree feature transforms Logistic regression for linear classifier Data freshness Online data joiner Experiment Conclusion Thought 30

Thanks for listening. 2014 / 1 / 27 ( Tue. ) @ MakeLab Lab Meeting [email protected]
Tags