decision tree learning presented by ping zhang nov. 26th, 2007

22

Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Upload: rafe-newton

Post on 06-Jan-2018

217 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

Decision tree representation Decision tree classify instances by sorting them down the tree from the root to some leaf node, which provides the classification of the instance Each node in the tree specifies a test of some attribute of the instance, and each branch descending from that node corresponds to one of the possible values for this attributes

TRANSCRIPT

Page 1: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Decision Tree Learning

Presented by Ping Zhang

Nov. 26th, 2007

Page 2: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Introduction Decision tree learning is one of the most

widely used and practical method for inductive inference

Decision tree learning is a method for approximating discrete-valued target functions, in which the learned function is represented by a decision tree

Decision tree learning is robust to noisy data and capable of learning disjunctive expressions

Page 3: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Decision tree representation Decision tree classify instances by

sorting them down the tree from the root to some leaf node, which provides the classification of the instance

Each node in the tree specifies a test of some attribute of the instance, and each branch descending from that node corresponds to one of the possible values for this attributes

Page 4: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Decision Tree for PlayTennis

Page 5: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

When to Consider Decision Trees Instances describable by attribute-value

pairs Target function is discrete valued Disjunctive hypothesis may be required Possibly noisy training data

Examples (Classification problems): Equipment or medical diagnosis Credit risk analysis

Page 6: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Top-Down Induction of Decision Trees

Page 7: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Entropy (1)

Page 8: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Entropy (2)

Page 9: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Information Gain

Page 10: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Training Examples

Page 11: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Selecting the Next Attribute

Page 12: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Which attribute should be tested here?

Page 13: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Hypothesis Space Search by ID3 Hypothesis space is complete Target function surely in there Only outputs a single hypothesis No back tracking Local minima Statically-based search choices Robust to noisy data Inductive bias: “prefer shortest tree”

Page 14: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

From ID3 to C4.5 C4.5 made a number of improvements to ID3. Some of these are: Handling both continuous and discrete attributes Handling training data with missing attribute value Handling attributes with differing costs Pruning trees after creation

Page 15: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Overfitting in Decision Trees

Page 16: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Reduced-Error Pruning

Page 17: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Rule Post-Pruning Convert tree to equivalent set of rules Prune each rule by removing any preconditions that result in

improving its estimated accuracy Sort the pruned rules by their estimated accuracy, and consider

them in this sequence when classifying subsequent instance

Perhaps most frequently used method

Page 18: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Continuous Valued Attributes Create a discrete attribute to test continuous

There are two candidate thresholds The information gain can be computed for each of the candidate attributes, Temperature>54 and Temperature>85, and the best can be selected(Temperature>54)

Page 19: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Attributes with many ValuesProblems: If attribute has many values, Gain will select it Imagine using the attribute Data. It would have the

highest information gain of any of attributes. But the decision tree is not useful.

Page 20: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Missing Attribute Values

Page 21: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Attributes with Costs Consider Medical diagnosis, BloodTset has cost 150 dallors How to learn a consistent tree with low expected cost?

Page 22: Decision Tree Learning Presented by Ping Zhang Nov. 26th, 2007

Conclusion Decision Tree Learning is Simple to understand and interpret Requires little data preparation Able to handle both numerical and

categorical data Use a white box model Possible to validate a model using

statistical tests Robust, perform well with large data in

a short time

Self-equilibrium and super-stability of truncated regular polyhedral tensegrity structures: a unified analytical solution by Li-Yuan Zhang, Yue Li, Yan-Ping

Network Understanding of Herb Medicine via Rapid ... · PDF fileNetwork Understanding of Herb Medicine via Rapid Identification of Ingredient-Target Interactions Hai-Ping Zhang

Cody Dunne, Pengyi Zhang, Chen Huang, Jia Sun, Ben Shneiderman, Ping Wang & Yan Qu {cdunne, ben}@cs.umd.edu {pengyi, chhuang, jsun, pwang, yanqu}@umd.edu

Facial Landmark Detection by Deep Multi-task …personal.ie.cuhk.edu.hk/~ccloy/files/eccv_2014_deepface...Facial Landmark Detection by Deep Multi-task Learning Zhanpeng Zhang, Ping

Top-quark Physics at LHC Huaqiao ZHANG (Michigan State University) Mar/26/2013H. ZHANG @ NJU 1 26th, Mar. 2013; Nanjing, China 2005 － 2008 ： IHEP/CPPM

Diagnosis, Synthesis and Analysis of Probabilistic Models · Ping Yu, Haidi Yue, Yuqi Zhang, Ziyun Zhang and many other friends. I am forever indebted to my family, especially my

Thresholding neural network for adaptive noise reduction - …xzhang/publications/TNN-v12n3-zhang.pdf · Thresholding Neural Network for Adaptive Noise Reduction Xiao-Ping Zhang,

Redesign EPSY 356 May 2, 2002 Final Project Presentation Julie Coiro James Sulzen Haibei Zhang Dong Ping Zheng

1 Predictors of customer perceived software quality Paul Luo Li (ISRI – CMU) Audris Mockus (Avaya Research) Ping Zhang (Avaya Research)

Sentence Semantic Distance and Novelty Detection Hua-Ping Zhang [email protected] LCC Group, Software Division, Inst. of Computing Tech., CAS

Supplementary information Magnetoelectric interaction and ... · Wenyu Zhao, Zhiyuan Liu, Ping Wei, Qingjie Zhang, Wanting Zhu, Xianli Su, Xinfeng Tang, Jihui Yang, Yong Liu, Jing

Integration of Multiple Annotators by Aggregating …zoran/papers/B314_4099.pdfIntegration of Multiple Annotators by Aggregating Experts and Filtering Novices Ping Zhang and Zoran

Nucleation of hcp and fcc phases in bcc iron under uniform ... · B T Wang, J L Shao, G C Zhang et al.- ... E-mail: zhang [email protected] Received 9 August 2010, in ﬁnal form 2

DNA Barcoding in the Genome Era Ya-ping Zhang Kunming Institute of Zoology Chinese Academy of Sciences Yunnan University The Second International Barcoding

A New Operational Tool Derived from Remotely- sensed Vegetation Metrics for Drought Monitoring and Yield Estimation Ping Zhang Department of Geography

Ying Wang Wen'an Zhou Ping Zhang QoE Management in

EE 4FJ4: Microwave Engineering Instructor: Dr. Wei-Ping Huang, CRL Rm. 207, ext. 27696, [email protected] Teaching Assistants: Mr. Kevin Zhang,

0DWHULDO (6, IRU&KHPLFDO&RPPXQLFDWLRQV 7KLV · Meng-Yao Zhang, Sheng Zhou, Hong-Bing Pan, Jing Ping, Wei Zhang, Xing-He Fan and Zhihao Shen* Beijing National Laboratory for Molecular

Fang Qian, Changli Zhang, Yumin Zhang, Weijiang He, Xiang Gao, Ping Hu, and Zijian Guo

TOP-PIM: Throughput-Oriented Programmable Processing in … · TOP-PIM: Throughput-Oriented Programmable Processing in Memory Dong Ping Zhang, Nuwan Jayasena, Alex Lyashevsky Joe

JIPB Plant Biology€¦ · Liang‐Sheng Zhang Linsheng Zhang Meixiang Zhang Mingcai Zhang Shuqun Zhang Xianlong Zhang Xiurong Zhang Yan Zhang Yijing Zhang Yuelin Zhang Bingran Zhao

Liquan Shen , Zhi Liu, Suxing Liu, Zhaoyang Zhang, and Ping An

Copyright 2006 John Wiley & Sons, Inc Chapter 7 – Evaluation HCI: Developing Effective Organizational Information Systems Dov Te’eni Jane Carey Ping Zhang

Chemotherapeutic Drugs Wei-Ping Zhang, PhD 张纬萍 Dept. of Pharmacology, School of Medicine, Zhejiang University [email protected] 2013.12.16

Legends. Made by 李平章黄耀宗 Li Ping Zhang Ng Yew Chong

Wei-Ping Zhang, Ph.D. [email protected] Department of pharmacology, School of medicine, Zhejiang University Immunomodulator 2012.10.31

Boundary vertices in graphs Discrete Mathematics 263 (2003) 25-34 Gary Chartrand, David Erwin Garry L. Johns, Ping Zhang

ISSN 2307-8960 (online) World Journal of Clinical Cases...Ya-Ping Li, Ying Yang, Mu-Qi Wang, Xin Zhang, Wen-Jun Wang, Mei Li, Feng-Ping Wu, Shuang Suo Dang, Department of Infectious

Copyright 2006 John Wiley & Sons, Inc. Chapter 1 - Introduction HCI: Designing Effective Organizational Systems Dov Te’eni Jane Carey Ping Zhang

CONTENTS · Jun-Ge Han , Hong Jin , Lin-Bo Gao , Jian Zhang , Xue-Ke Deng , Li-Juan Li , Chang-Ping Song , Tao Wang , Lin Zhang 100-104 18 Adoption of Decision Support Systems to

Wei-Ping Zhang, Ph.D. weiping601@zju

cient Analytics for Optimal Human-Cyber-Physical …digitalassets.lib.berkeley.edu/etd/ucb/text/Jin_berkeley_0028E... · Cheung, Chris Hsu, Chris Soyza, Ping Liu, Yu Zhang, Wei Qi,

Learning to Transform Natural to Formal Language Presented by Ping Zhang Rohit J. Kate, Yuk Wah Wong, and Raymond J. Mooney

World Journal of - bsdwebstorage.blob.core.windows.net · Ping-Heng Tan, Kaohsiung Chien-Kun Ting, ... Tao Luo, Wuhan Fan Qu, Hangzhou Fu-Zhou Wang, ... Syracuse Ruixin Zhang, Baltimore