![Page 1: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/1.jpg)
1
An Excel-based Data Mining Tool
Chapter 4
![Page 2: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/2.jpg)
2
4.1 The iData Analyzer
![Page 3: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/3.jpg)
3
Data
PreProcessor
Interface
HeuristicAgent
NeuralNetworks
LargeDataset
ESX
MiningTechnique
GenerateRules
RulesRuleMaker
ReportGenerator
ExcelSheets
Explaination
Yes
No
No
Yes
Yes
No
![Page 4: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/4.jpg)
4
![Page 5: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/5.jpg)
5
4.2 ESX: A Multipurpose Tool for Data Mining
![Page 6: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/6.jpg)
6
ESX
• Supports supervised learning and unsupervised clustering
• Does not make statistical assumptions
• Deal with missing attribute values
• Applied to categorical and numerical data
• Point out inconsistencies and unusual values
![Page 7: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/7.jpg)
7
• For supervised classification, ESX can determine those instances and attributes best able to classify new instances
• For unsupervised clustering, ESX incorporates a globally optimizing evaluation function that encourages a best instance clustering
![Page 8: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/8.jpg)
8
Root
CnC1 C2
I11 I1jI12
Root Level
Instance Level
Concept Level
. . .
. . .
I21 I2kI22
. . . In1 InlIn2
. . .
![Page 9: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/9.jpg)
9
4.3 iDAV Format for Data Mining
![Page 10: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/10.jpg)
10
Table 4.1 • Credit Card Promotion Database: iDAV Format
Income Magazine Watch Life Insurance Credit CardRange Promotion Promotion Promotion Insurance Sex Age
C C C C C C RI I I I I I I
40–50K Yes No No No Male 4530–40K Yes Yes Yes No Female 4040–50K No No No No Male 4230–40K Yes Yes Yes Yes Male 4350–60K Yes No Yes No Female 3820–30K No No No No Female 5530–40K Yes No Yes Yes Male 3520–30K No Yes No No Male 2730–40K Yes No No No Male 4330–40K Yes Yes Yes No Female 4140–50K No Yes Yes No Female 4320–30K No Yes Yes No Male 2950–60K Yes Yes Yes No Female 3940–50K No Yes No No Male 5520–30K No No Yes Yes Female 19
![Page 11: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/11.jpg)
11
Table 4.2 • Values for Attribute Usage
Character Usage
I The attribute is used as an input attribute.
U The attribute is not used. D The attribute is not used for classification or clustering, but
attribute value summary information is displayed in all output reports.
O The attribute is used as an output attribute. For supervised learning with ESX, exactly one categorical attribute is selected as the output attribute.
![Page 12: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/12.jpg)
12
4.4 A Five-step Approach for Unsupervised Clustering
Step 1: Enter the Data to be Mined
Step 2: Perform a Data Mining Session
Step 3: Read and Interpret Summary Results
Step 4: Read and Interpret Individual Class Results
Step 5: Visualize Individual Class Rules
![Page 13: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/13.jpg)
13
Step 1: Enter The Data To Be Mined
![Page 14: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/14.jpg)
14
![Page 15: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/15.jpg)
15
Step 2: Perform A Data Mining Session
![Page 16: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/16.jpg)
16
![Page 17: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/17.jpg)
17
![Page 18: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/18.jpg)
18
Step 3: Read and Interpret Summary Results
• Class Resemblance Scores• Domain Resemblance Score
–Attributes, instances, no model• Domain Predictability
![Page 19: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/19.jpg)
19
![Page 20: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/20.jpg)
20
![Page 21: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/21.jpg)
21
Step 4: Read and Interpret Individual Class Results
• Class Predictability is a within-class measure.
• Class Predictiveness is a between- class measure.
![Page 22: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/22.jpg)
22
![Page 23: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/23.jpg)
23
![Page 24: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/24.jpg)
24
Step 5: Visualize Individual Class Rules
![Page 25: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/25.jpg)
25
![Page 26: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/26.jpg)
26
4.5 A Six-Step Approach for Supervised Learning
Step 1: Choose an Output Attribute
Step 2: Perform the Mining Session
Step 3: Read and Interpret Summary Results
Step 4: Read and Interpret Test Set Results
Step 5: Read and Interpret Class Results
Step 6: Visualize and Interpret Class Rules
![Page 27: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/27.jpg)
27
Read and Interpret Test Set Results
![Page 28: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/28.jpg)
28
4.6 Techniques for Generating Rules
• 1. Choose an attribute
• 2. use the attribute to subdivide instances into classes
• 3. – if the instances in the subclass satisfy a
predefined criteria, generate a defining rule– If not, repeat 1
![Page 29: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/29.jpg)
29
4.6 Techniques for Generating Rules
1. Define the scope of the rules.
2. Choose the instances.
3. Set the minimum rule correctness.
4. Define the minimum rule coverage.
5. Choose an attribute significance value.
![Page 30: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/30.jpg)
30
![Page 31: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/31.jpg)
31
4.7 Instance Typicality
![Page 32: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/32.jpg)
32
Typicality Scores
• Identify prototypical and outlier instances.
• Select a best set of training instances.
• Used to compute individual instance classification confidence scores.
![Page 33: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/33.jpg)
33
![Page 34: 1 An Excel-based Data Mining Tool Chapter 4. 2 4.1 The iData Analyzer](https://reader035.vdocuments.site/reader035/viewer/2022062320/56649d785503460f94a5ba1b/html5/thumbnails/34.jpg)
34
4.8 Special Considerations and Features
• Avoid Mining Delays
• The Quick Mine Feature
• Erroneous and Missing Data