barbara pogorzelska, technical program...

54
Targeted Marketing with Amazon Machine Learning Barbara Pogorzelska, Technical Program Manager

Upload: others

Post on 09-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Targeted Marketing with Amazon Machine Learning

Barbara Pogorzelska,

Technical Program Manager

Page 2: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Agenda

• Problem description

• Downloading, editing and uploading the data

• Datasource creation

• ML model creation

• Model evaluation

• Batch prediction

• Clean up

Page 3: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Problem Description

Page 4: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Machine learning & the use case

Machine learning is the technology that automatically finds

patterns in your data and uses them to make predictions for

new data points as they become available

Problem

How to identify potential customers for targeted marketing

campaigns?

Data available

Publicly available banking and marketing dataset from the

University of California at Irvine (UCI) repository

Page 5: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Amazon Machine Learning tutorial

Page 6: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Amazon Machine Learning tutorial

Page 7: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Data (see http://archive.ics.uci.edu/ml/datasets/Bank+Marketing)

Training data

41188 data points

20 attributes

binary output

Batch predictions

4119 data points

Bank client data

1 - age 2 - job (admin., blue-collar, entrepreneur, …)3 - marital (single, divorces, married, …) 4 - education (basic.4y, basic.6y, university.degree, …)5 - default: has credit in default? 6 - housing: has housing loan?

7 - loan: has personal loan?

Related with the last contact of the current campaign

8 - contact: communication type: (cellular, telephone) 9 - month: last contact month of year10 - day_of_week: last contact day of the week 11 - duration: last contact duration, in seconds

Other attributes

12 - campaign: number of contacts performed during this

campaign and for this client

13 - pdays: number of days that passed by after the client

was last contacted from a previous campaign

14 - previous: number of contacts performed before this

campaign and for this client

15 - poutcome: outcome of the previous marketing

campaign

Social and economic context attributes

16 - emp.var.rate: employment variation rate

17 - cons.price.idx: consumer price index

18 - cons.conf.idx: consumer confidence index

19 - euribor3m: euribor 3 month rate - daily indicator

20 - nr.employed: number of employees

Output variable (desired target)

21 - y - has the client subscribed a term deposit?

Page 8: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Three steps to create a prediction

Page 9: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Downloading, editing and uploading the data

Page 10: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Storing the data on S3

• Download from https://s3.amazonaws.com/aml-sample-data/banking.csv and

https://s3.amazonaws.com/aml-sample-data/banking-batch.csv

– Replaced yes/no with 1/0

• Store data on S3

Page 11: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Datasource creation

Page 12: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Datasource creation

Page 13: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Datasource creation

Page 14: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Datasource creation

Page 15: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Datasource creation

Page 16: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Datasource creation

Page 17: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Three steps to create a prediction

Page 18: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

ML model creation

Page 19: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – default model

Page 20: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – default model

Page 21: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – default model

Page 22: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – default model

Page 23: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model evaluation

Page 24: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model evaluation – default model

Page 25: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model evaluation – default model

Page 26: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model evaluation – default model

Page 27: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Data Insights

Page 28: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Data Insights

Page 29: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Data Insights

Page 30: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Data Insights

Page 31: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Data Insights

Page 32: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Data Insights

Page 33: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – custom model

Page 34: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – custom model

Page 35: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – custom model

Page 36: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Model creation – custom model

Page 37: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Three steps to create a prediction

Page 38: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions

Page 39: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 40: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 41: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 42: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 43: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 44: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 45: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 46: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - default model

Page 47: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - custom model

Page 48: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - custom model

Page 49: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - custom model

Page 50: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Batch predictions - custom model

Page 51: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Clean up

Page 52: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Clean up your account

To delete the input data used for training, evaluation, and batch prediction steps

1. Open the Amazon S3 console.

2. Navigate to the S3 bucket where you stored the banking.csv and banking-batch.csv.

3. Select the two files and the .writePermissionCheck.tmp file.

4. Choose Actions, Delete.

5. When prompted for confirmation, choose OK.

To delete the predictions generated from the batch prediction step

1. Open the Amazon S3 console.

2. Navigate to the bucket where you stored the output of the batch predictions.

3. Select the batch-prediction folder.

4. Choose Actions, Delete.

5. When prompted for confirmation, click OK.

Page 53: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

Try out machine-learning-samples from github

Page 54: Barbara Pogorzelska, Technical Program Manageraws-de-media.s3.amazonaws.com/images/Webinar/2016... · Training data 41188 data points 20 attributes binary output Batch predictions

https://aws.amazon.com/de/machine-learning/

Get Started on AWS with

Amazon Machine Learning