dynamic customer segment analysis and behavior prediction using data mining
DESCRIPTION
National Tsing Hua University Department of Industrial Engineering and Engineering Management. Dynamic customer segment analysis and behavior prediction using data mining. Group 1: Margaret Dlamini Saumen Bhaumick Daniel Chen Ricky Huang July Panoso. Abstract. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/1.jpg)
Dynamic customer segment analysis and behavior prediction using data mining
Group 1:Margaret Dlamini
Saumen BhaumickDaniel ChenRicky HuangJuly Panoso
National Tsing Hua UniversityDepartment of Industrial Engineering and
Engineering Management
![Page 2: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/2.jpg)
Abstract
CRM is mainly to Understand customer well By Studying the difference between the
Customers through customer segmentation. Track customers shift from segment to
segment Discover customer segment knowledge Predict Customer segments behavior pattern
![Page 3: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/3.jpg)
CRM
We believe keeping and managing the customer is most important:
•Attractive Personalized services to satisfy Customer needs
•CRM- Closer and deeper relationships with customers
![Page 4: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/4.jpg)
Understanding Customers.
• Analyzing Customers Information.
•Differentiate Customers through Segmentation
• Increase Customer loyalty through Customized products
Predict Customers Purchase behavior
![Page 5: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/5.jpg)
Contact and Serve CustomersThrough Channels
To understand customers its essential to integrate the data collected thru.
Web browsing Purchase behavior Complaints Demographics
![Page 6: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/6.jpg)
THE DATA
The Customer segments and related knowledge discovered from multiple data sources change as Customer base changes
Thus valid for a particular period Most existing predictions methods
fundamentally are based on numerical and historical data patterns (using simple regression or neural network techniques)
![Page 7: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/7.jpg)
FLUCTUATIONS
This can be quite fluctuating caused due to
Promotions New product launching Customer support policies
![Page 8: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/8.jpg)
Customer Segment
This study tracks the customer shift among customer segments
Monitor changes overtime To discover customer segment
knowledge Predict Customer’s segment
behavior pattern
![Page 9: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/9.jpg)
Prediction on Customers behavior
By studying the segment shift each customer might shift
Build a career path of each customer
By aggregating each customers career path, derive the Dominant career paths (majority of customers follow)
![Page 10: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/10.jpg)
Process to Segment Customers
Choose a basis of segmentation, with appropriate variables (demographic or behavioral)
Use a multivariate analysis to group together or split customers.
Evaluate and validate the outputs. Analyze the results in economic
terms
![Page 11: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/11.jpg)
Segmentation Design schemes
Measure used for segmentation Number of resulting segments View about the change overtime Segmentation techniques used Number of the customers
selected
![Page 12: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/12.jpg)
Segmentation MeasuresThe segmentation variables consists of one or a
combination of the following Demographic Geographic Psychographic or BehavioralThe behavioral purchase pattern can beRFM (Recency, Frequency and Monetary)FRAT (Frequency, Recency, Amount & Type)
![Page 13: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/13.jpg)
Number of Resulting Segments
Minimize combined direct and opportunity cost of the Segmentation as critera for optimum number of segments
Allow the derivation of equal sized segments
Judgmental decisions are on the basis of number of segments
![Page 14: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/14.jpg)
View about change overtime
Through occasion based design that assumes that people vary in their needs across occasions of product purchase.
Other way is to consider time-segmented customers through repeated measurements of the same customer at different point in times
![Page 15: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/15.jpg)
Segmentation TechniquesStatistical Methods K -mean algorithm Discriminant Analysis Logistic RegressionMachine learning Techniques Neural Networks (Normally its considered that
neural network are more accurate compared to statistical methods)
![Page 16: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/16.jpg)
Number of Customers
The Customer segmentation can incorporate all the customers or can be limited to sample of them.
If the segmentation is based on sample, its important to predict how many customer falls in that group (Via inferential statistics)
![Page 17: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/17.jpg)
Profitability
Predict changes in the segment to derive static characteristic of the segment
Changes in the segment closely relates to increase or decrease in profitability obtained from the segment
![Page 18: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/18.jpg)
Research Overview This study focus on behavioral variables inc
lude customer’s product usage. Recency, Frequency, Monetary (RFM) analys
is. Self-Organizing Map (SOM) : uses neural clu
stering method to divided the retailer’s customer into numerous groups.
![Page 19: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/19.jpg)
Cont. This paper collect data from July 2001 to
September 2002. Segment customers five times during
fifteen months One quarter is a time window to create
new segmentation.
![Page 20: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/20.jpg)
Cont. Individual career path: present a single
customer’s history of shifts. Dominant career path: a descriptive
pattern, which explains common histories most customer might follow.
One leading to a loyal segment and the other leading to a vulnerable segment.
This study also provide a analytical method for predicting time-variant segment movement a customer might show.
![Page 21: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/21.jpg)
SEGMENTING CUSTOMERS
We should be use a clustering analysis of product usage or purchase. Purchase transactions
have four features:
Customer number or customer ID
Recency value Frequency value Monetary value
![Page 22: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/22.jpg)
Data preparation for the segmentation
We have 3 situations:
Newcomers (don’t have any purchase before period t)
Old customers (but made purchase during period t)
Old customers (but don’t make purchase during period t)
How can we calculate RFM?
![Page 23: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/23.jpg)
Newcomers (do not have any purchase history before period t)
rt = measures how long they made purchaseft = measures how frequently they make purchasemt = measures how much money t
hey spend
![Page 24: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/24.jpg)
Old customers (but made purchase during period t)
Rt-1 - rt = Recency value for period tFt-1 - ft = Frequency value for period tMt-1 - mt = Monetary value for period t
Note. Rt-1, Ft-1 ,Mt-1, stand for cumulative to period t-1
![Page 25: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/25.jpg)
Old customers (but don’t make purchase during period t)
Rt-1 + 3 months = Recency value for period t
Ft-1 + 3 months = Frequency value for period t
Mt-1 + 3 months = Monetary value for period t
![Page 26: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/26.jpg)
Self-organization of customersThe SOM does unsupervised clustering
Records within a group or cluster tend to be similar to each other
Records in different groups are dissimilar
The SOM will end up with a few output units:
- Strong units- Weak units
The strong Units represent probable cluster centers
![Page 27: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/27.jpg)
Segmentation results
2 techniques to speed up the SOM:
It is to vary the size of the neighborhoods: From large to small
The other is to have the winning neuron use a larger learning rate than that of the neighboring neurons
![Page 28: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/28.jpg)
Summary of customer statistics per quarter
![Page 29: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/29.jpg)
Summary of customer segment characteristics for the third quarter of 2001
Loyal
Vulnerable
Newcomer
![Page 30: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/30.jpg)
Result of the successive five-time segmentation
![Page 31: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/31.jpg)
Discovering individual career path and dominant career path
Five-time segmentation makes it possible to combine segment shift histories into a career path.
Natural life cycle Migration External factors
![Page 32: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/32.jpg)
Changes in segments
Over successive quarters there are changes in the number of
customers in a segment indicating certain strategies that
management should review for the CRM
![Page 33: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/33.jpg)
To Q4 2001 From Q3 2001 R↓F↑M↑ R↓F↓M↓ R↑F↓M↓ Customer Before Shifts
R↓F↑M↑ 24,577 2,267 3,952 30,796
R↓F↓M↓ 5,472 16,181 14,563 36,216
R↑F↓M↓ 2,778 9,387 17,788 29,953
R↑F↑M↑ 461 148 282 891
Customers afters shifts 33,288 27,983 36,585 97,856
Segment shifts of customers from Q3 2001 to Q4 2001
![Page 34: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/34.jpg)
Path No.of customers Probability (%)
R↓F↑M↑→ R↓F↑M↑→ R↓F↑M↑ 20,495 42.0
R↓F↓M↓→ R↑F↓M↓→ R↓F↑M↑ 5,658 11.6
R↑F↓M↓→ R↓F↑M↑→ R↓F↑M↑ 3,386 6.9
R↑F↓M↓→ R↓F↑M↑→ R↓F↑M↑ 2,999 6.1
R↓F↓M↑→ R↓F↑M↑→ R↓F↑M↑ 2,457 5.0
Dominant career paths of length 3, which lead to segment R↓F↑M↑
![Page 35: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/35.jpg)
Dominant Career Paths of length 5, which lead to segment R↑F↓M↓
Path No.of customers Probability
(%)
R↑F↓M↓→R↑F↓M↓→ R↑F↓M↓→ R↑F↓M↓→ R↑F↓M↓ 8,645 20.90
R↓F↓M↓→ R↑F↓M↓→ R↑F↓M↓→ R↑F↓M↓→ R↑F↓M↓ 5,460 13.20
R↓F↓M↓→ R↓F↓M↓→ R↑F↓M↓→ R↑F↓M↓→ R↑F↓M↓ 2,010 4.86
R↑F↓M↓→ R↓F↓M↓→ R↑F↓M↓→ R↑F↓M↓→ R↑F↓M↓ 1,924 4.65
R↓F↑M↑→ R↑F↓M↓→ R↑F↓M↓ → R↑F↓M↓→ R↑F↓M↓ 875 2.11
![Page 36: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/36.jpg)
Predicting Career Paths
Prediction of customer’s segment shifts can be classified as a classification task from the data mining perspective.
This case study use a decision tree induction technique and choose C5.0 to predict the time-variant career paths.
![Page 37: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/37.jpg)
Decision Tree Induction Technique
The C5.0 algorithm has a special method form improving its accuracy rate called boosting.
Boosting working by building mutiple models in a seqience.
The next tree is used to modify and improve the previous one.
![Page 38: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/38.jpg)
Data Preparation for the Prediction
The case generate 6 models for categorical predictions.
Choose the best model with the highest accuracy.
![Page 39: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/39.jpg)
Training six prediction modelsQuarter/
ModelQ3 2001 Q4 2001 Q1 2002 Q2 2002 Q3 2002
PMa Attribute Attribute Class
PMb Attribute Attribute Class
PMc Attribute Attribute Class
PMd Attribute Attribute Attribute Class
PMe Attribute Attribute Attribute Class
PMf Attribute Attribute Attribute Attribute Class
![Page 40: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/40.jpg)
Summary of the Prediction Accuracy of C5.0 Models
Model No. of attributes
Pruning severity
Prediction Accuracy (%)
PMa 2 75 59.74
PMb 2 80 61.68
PMc 2 70 71.27
PMd 3 94 62.28
PMe 3 78 71.38
PMf 4 75 71.13
![Page 41: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/41.jpg)
Prediction Accuracy Statistics for Best Model, PMe
Predicted Values at Q4 2002
Actual Values
at Q4 2002
R↓F↑M↑ R↑F↓M↓ Total
R↓F↓M↓ 2102 2009 4111
R↓F↓M↑ 3907 3071 6978
R↓F↑M↑ 36286 9165 45451
R↑F↓M↓ 8183 33133 41316
Total 50478 47378 97856
![Page 42: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/42.jpg)
Prediction Accuracy Statistics for Best Model, PMe
Predicted Values at Q4 2002
Actual Values
at Q4 2002
R↓F↑M↑ R↑F↓M↓ Total
R↓F↓M↓ 2102 2009 4111
R↓F↓M↑ 3907 3071 6978
R↓F↑M↑ 36286 9165 45451
R↑F↓M↓ 8183 33133 41316
Total 50478 47378 97856
Newcomer Segment
![Page 43: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/43.jpg)
Prediction Accuracy Statistics for Best Model, PMe
Predicted Values at Q4 2002
Actual Values
at Q4 2002
R↓F↑M↑ R↑F↓M↓ Total
R↓F↓M↓ 2102 2009 4111
R↓F↓M↑ 3907 3071 6978
R↓F↑M↑ 36286 9165 45451
R↑F↓M↓ 8183 33133 41316
Total 50478 47378 97856
Total Predict Accuracy (36286+ 33133) / 97856 *100% = 71%
R↓F↑M↑ Predict Accuracy (36286) / 50478 *100% = 72%
R↑F↓M↓ Predict Accuracy (33133) / 47378 *100% = 70%
![Page 44: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/44.jpg)
Performance evaluation of PMe Model Because the training set contains only
a few cases about the newcomer segments(7.8%), the model PMe could hardly learn the pattern about them.
Accuracy predictions for rare categories will earn a higher performance evaluation.
![Page 45: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/45.jpg)
Conclusion This paper have proposed segment-based
knowledge discovery method used for derivation of the descriptive pattern: predict the path customer will shift.
Try to resolve the fundamental problems : changing characteristics of customer in segment and change in its composition.
![Page 46: Dynamic customer segment analysis and behavior prediction using data mining](https://reader036.vdocuments.site/reader036/viewer/2022062721/5681378a550346895d9f2551/html5/thumbnails/46.jpg)
Cont. Further research Extend the prediction accuracy
Using neural network Building a separate classifier for
different segments and combining result from multiple classifier.