comp 328: midterm review spring 2010 nevin l. zhang department of computer science & engineering...

COMP 328: Midterm Review

Spring 2010

Nevin L. Zhang

Department of Computer Science & Engineering

The Hong Kong University of Science & Technology

http://www.cse.ust.hk/~lzhang/

Can be used as cheat sheet




Overview

Algorithms for supervised learning Decision trees

Naïve Bayes classifiers

Neural networks

Instance-based learning

Support vector machines

General issues regarding supervised learning Classification error and confidence interval

Bias-Variance tradeoff

PAC learning theory

Supervised Learning

Decision Trees

Decision trees

Reduced-Error Pruning

Decision Trees

Issues with attributes Continuous

Attributes with many values Use GainRatio instead of Gain

Missing values

Tree construction is a search process Local minimum

Naïve Bayes Classifier

Can classify using this rule:

But, joint too expensive to get

Naïve Bayes Classifier

Learning Naïve Bayes Classifier

Laplace smoothing Continuous attribute When independence not true, double counting of evidence Generalization: Bayesian networks

Neural Networks

For classification and regression

Neural Networks

Activation function Step, sign

Sigmoid, tanh (hyperbolic tangent)

Neural Network/Properties

Perceptrons are linear classifier

Two-layer network with enough perceptron units can

represent all Boolean functions

One layer with enough sigmoid units can approximate any

functions well

Neural Network

Converge only when linearly separable

Neural Network

Adaline learning: Delta rule

Neural Network

Instance-Based Learning

Lazy learning K-NN

Distance-weighted k-NN (kernel regression)

Locally weighted regression

Support Vector Machines

SVM

Data not linearly separable

Nonlinear SVM

Impact of σ and C

Classifier Evaluation

Relationship between

Algorithm Evaluation/Model Selection

Which learning algorithm to use? Given algorithm, which model to use? (How many hidden units?)

Algorithm Evaluation/Model Selection

Bias-Variance Decomposition

Bias-Variance Tradeoff

For classification problem also

PAC Learning Theory

Probably approximate correct (PAC)

Relationship between

PAC Learning Theory

VC Dimension

Sample Complexity

comp 328: midterm review spring 2010 nevin l. zhang department of computer science & engineering...

Documents

regression slide

separable slide

separable page

c page

supervised learning

neural networks page

neural network page

decision trees page